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NOVEL IMMUNOADHESINS FOR TREATING AND PREVENTING TOXICITY 
AND PATHOGEN-MEDIATED DISEASES 



RELATED APPLICATIONS 

5 This application claims priority as a continuation-in-part application of Larrick 

andWycoff, International Patent Application Ser. No. PCT7US01/13932, filed April 28, 
2001, and entitled NOVEL IMMUNOADHESIN FOR THE PREVENTION OF 
RHINOVIRUS INFECTION, which in turn claims priority to United States Provisional 
Application Ser. No. 60/200,298, filed April 28, 2000, and entitled the same. Each of 
10 these applications is herein incorporated by reference in its entirety, including all figures, 
drawings, and sequence listings. 

STATEMENT REGARDING FEDERALLY SPONSORED 
RESEARCH OR DEVELOPMENT 

Federal research support was provided in the form of an SBER Phase I grant (R43 
15 AI43122) and SBIR Phase II grant (2R44AI43 122-02). 

FIELD OF THE INVENTION 

The present invention relates to immunoadhesins, their production from plants, and 
their use in the treatment and prevention of toxicity and pathogen-mediated ailments such 
as anthrax and the common cold. 

20 BACKGROUND OF THE INVENTION 

The common cold is generally a relatively mild disease. However, significant 
complications resulting from colds, such as otitis media, sinusitis and asthma 
exacerbations are common. Human rhinoviruses (HRV) cause up to 50% of all adult colds 
and 25% of colds in children (Bella and Rossmann, J Struct Biol 128:69-74, 1999, and 
25 Sperber and Hayden, Antimicrob Agents Chemother. 32:409-19, 1988). The cost to 
society runs into billions of dollar per year. These small, nonenveloped RNA viruses 
represent a subgroup of picornavirus (Rueckert, Virology \ pp. 507-548, eds. Fields, et aL, 
Raven Press, Ltd. New York, 1990) X-ray crystallography of rhinovirus identified a capsid 
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300 A in diameter (1 A = 0.1 nm) with icosahedral symmetry, constructed from sixty 
copies each of the viral coat proteins VP1, VP2, and VP3 (Rossmann, Nature 
317:145-153, 1985). A surface depression or "canyon" on HRV was suggested as the 
receptor binding site (Colonno, etal, Proc Natl Acad Sci USA. 85:5449-5453, 1985; 
5 Rossmann, et al Nature 317:145-153, 1985). Of the 102 characterized HRV serotypes, 91 
(known as the major group) share as their receptor a cell surface glycoprotein known as 
intercellular adhesion molecule-1 (ICAM-1) (Greve, et al, Cell 56:839-847, 1989; 
Staunton, et al, Cell 56:849-853, 1989); the binding site is located within N-terminal 
domain 1 (Greve, et al, J Virol 65:6015-6023, 1991; Staunton, et al, Cell 61 :243-254, 
10 1990). 

ICAM-1 is a membrane protein with five extracellular domains, a hydrophobic 
transmembrane domain, and a short cytoplasmic domain. ICAM-1 is expressed on many 
cells important in immune and inflammatory responses, and is inducible on others 
(Casasnovas, etal, Proc Natl Acad Sci USA. 95:4134-9, 1998). ICAM-1 functions as a 
15 ligand for the leukocyte integrins LFA-1 and Mac-1 (Springer, Cell 76:301-14, 1994; 

Staunton et al, Cell 61:243-254, 1990). On the cell surface, ICAM-1 is primarily a dimer 
due to association of the transmembrane domains (Miller, et al, J Exp Med. 182:1231-41, 
1995; Reilly, et alJ Immunol 155:529-32, 1995). 

Recombinant, soluble forms of ICAM-1 (sICAM-1) consisting of the five 
20 extracellular domains were shown to be effective in blocking rhinovirus infection of 

human cells in vitro (Greve, et al, J Virol 65:6015-6023, 1991; Marlin, et al, Nature. 
344:70-2, 1990). Evaluation of sICAM-1 activity against a spectrum of laboratory strains 
and field isolates showed that all major strains of HRV are sensitive to sICAM-1. Minor 
strains, which do not use ICAM as a receptor, were unaffected by sICAM-1 (Crump et al, 
25 Antiviral Chem Chemother. 4:323-327, 1993; Ohlin, et al, Antimicrob Agents Chemother. 
38:1413-5, 1994). 

The anti -viral activity of soluble ICAM-1 in vitro appears to be mediated by more 
than one mechanism. These mechanisms include competition with cell-surface ICAM-1 
for binding sites, interference with virus entry or uncoating, and direct inactivation by 
30 premature release of viral RNA and formation of empty capsids (Arruda, et al, 
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Antimicrob Agents Chemother. 36:1 186-1 191, 1992; Greve, et aL, J Virol. 65:6015-6023, 
1991; Marlin, et aL, Nature 344:70-2, 1990; Martin et aL, J Virol. 67:3561-8, 1993). 

The host range of HRV is restricted to primates. A recent study showed that 
soluble ICAM-1 was effective in preventing rhino virus infection in chimpanzees 
5 (Huguenel, et aL, Am J Respir Crit Care Med. 155:1206-10, 1997). Although 

chimpanzees do not show clinical symptoms, infection was demonstrated by measuring 
seroconversion and virus shedding. A single dose of 10 mg of soluble ICAM-1 as an 
intranasal spray was effective at preventing infection by HRV- 16 when co-administered 
with HRV, or when the virus was administered ten minutes later. 

10 A human clinical trial with soluble ICAM-1 showed that it reduced the severity of 

experimental HRV colds (Turner, et aL, JAMA 281:1797-804, 1999). In this trial a total of 
196 subjects received either soluble ICAM-1 or placebo in various formulations. Some 
subjects were given soluble ICAM-1 or placebo starting seven hours before inoculation 
with HRV 39 and others were started twelve hours after virus inoculation. Medications 

15 were administered as either an intranasal solution or powder, given in six daily doses for 
seven days (a total of 4.4 mg per day). In this study, soluble ICAM-1 did not prevent 
infection, as measured by either virus isolation or seroconversion (infection rate of 92% 
for placebo-treated vs. 85% of soluble ICAM-1 treated). However, soluble ICAM-1 did 
have an impact on all measures of illness. The total symptom score was reduced by 45%, 

20 the proportion of subjects with clinical colds was reduced 23% and nasal mucus weight 

was reduced by 56%. There was not a significant difference between the use of powder or 
solution formulations, or between pre- and post-inoculation groups. Treatment with 
soluble ICAM-1 did not result in any adverse effects or evidence of absorption through the 
nasal mucosa. Also, there was no inhibition of the development of anti-HRV type-specific 

25 antibodies. 

As discussed, ICAM-1 is dimeric on the cell surface. Martin et aL, in J Virol. 
67:3561-8, (1993) first proposed that multivalent binding to HRV by a multimeric soluble 
ICAM might result in a higher effective affinity, termed avidity, and thus facilitate 
uncoating of the virus. They constructed multivalent, ICAM-1 /immunoglobulin 
30 molecules, postulating that these would be more effective than monovalent soluble 

ICAM-1 in neutralizing HRV and thus would have increased therapeutic utility. These 
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ICAM-1 /immunoglobulin molecules included ICAM-1 amino-terminal domains 1 and 2 
fused to the hinge and constant domains of the heavy chains of IgAl (ICl-2D/IgA), IgM 
(ICl-2D/IgM) and IgGl(ICl-2D/IgG). In addition, five extracellular domains were fused 
to IgAl (ICl-5D/IgA). These ICAM-1 /immunoglobulin molecules were compared with 
5 soluble forms of ICAM-1 having two (sICl-2D) and five (sICl-5D) domains in assays of 
HRV binding, infectivity and conformation. The ICAM-l/IgA immunoglobulin 
(ICl-5D/IgA) was 200 times, and the ICAM-l/IgM immunoglobulin (ICl-2D/IgM) and 
ICAM-1 /IgG immunoglobulin molecules (ICl-2D/IgG) were 25 and 10 times, more 
effective than soluble ICAM-1 . These molecules were highly effective in inhibiting 
10 rhino virus binding to cells and disrupting the conformation of the virus capsid. The 

ICAM-l/IgA immunoglobulin molecules were effective in the nanomolar concentration 
range. Comparison of ICl-2D/IgA and ICl-2D/IgG showed that the class of Ig constant 
region used had a large impact on efficacy. 

A subsequent study compared the inhibitory activities of soluble ICAM-1 and 

15 ICl-5D/IgA against nine major HRV serotypes and a variant of HRV-39 selected for 
moderate resistance to soluble ICAM-1 (Crump, et aL, Antimicrob Agents Chemother. 
38:1425-7, 1993). ICl-5D/IgA was more potent than monomelic soluble ICAM-1 by 50 
to 143 times on a weight basis and by 60 to 170 times on a molar basis against the 
standard serotypes. The HRV-39 variant was 38-fold more resistant to soluble ICAM-1 

20 than the wild-type, and it was only 5-fold more resistant to ICl-5D/IgA. This is consistent 
with the hypothesis that virus escape from inhibition by multivalent molecules would be 
expected to occur at lower frequency than virus escape from inhibition by monomeric 
soluble receptor (Martin, et aL, J Virol. 67:3561-8, 1993). An assay designed to measure 
viral inactivation showed that HRV-39 and HRV- 13 were not directly inactivated to a 

25 significant extent by soluble ICAM-1 (<0.5 logio reduction in infectivity). However, 

incubation with ICl-5D/IgA resulted in a reduction of infectivity of these same viruses by 
about 1.0 logio (Crump, et aL, Antimicrob Agents Chemother. 38:1425-7, 1994). Results 
by Martin et al. {J Virol. 67:3561-8, 1993) suggest that the greater the valence, the greater 
the effectiveness of the molecules. Dimeric and decameric forms of ICl-2/IgM were 

30 separable by sucrose gradient sedimentation. The decameric form was five times more 
effective than the dimeric form at blocking binding of HRV to HeLa cells. 
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The ICAM-1 /immunoglobulin molecules that have been described suffer from 
several drawbacks, including the laborious production techniques and high costs 
associated with those production methods. In addition, the previously described ICAM- 
1 /immunoglobulin molecules have limited stability as multimers in the harsh environment 
5 in which the molecule must inactivate rhinoviruses. 

Applicants' previous, commonly owned application, International Application Ser. 
No. PCT/US01/13932, described the construction, purification, and use of chimeric 
immunoadhesin molecules, with examples and claims directed to treating or preventing 
viral infections and diseases, e.g., the common cold. There is a need for similar agents for 
10 the treatment and prevention of toxicity and other pathogen-mediated diseases and 

ailments, e.g., bacterial infections and diseases, such as anthrax. The bioterrorism scare 
following September 11, 2001 underscores this need. 

The tripartite toxin secreted by Bacillus anthracis, the causative agent of anthrax, 
helps the bacterium evade the immune system and can kill the host during a systemic 

15 infection. Two components of the toxin enzymatically modify substrates within the 

cytosol of mammalian cells: oedema factor (OF) is an adenylate cyclase that impairs host 
defences through a variety of mechanisms including inhibiting phagocytosis; lethal factor 
(LF) is a zinc-dependent protease that cleaves mitogen-activated protein kinase kinase and 
causes lysis of macrophages. Protective antigen (PA), the third component, binds to a 

20 cellular receptor and mediates delivery of the enzymatic components to the cytosol. After 
binding to the cell-surface receptor, PA is cleaved into two fragments by a furin-like 
protease. The amino-terminal fragment, PA 2 o, dissociates into the medium, and this 
allows the carboxy-terminal fragment, PAe3 to heptamerize and bind LF and OF. The 
resulting complexes of [PA63]7 with OF and/or LF are taken up into cells by receptor- 

25 mediated endocytosis and moved to a low-pH endosomal compartment. There, the acidic 
environment induces a conformational change in [PA 6 3]7 that allows it to insert into the 
membrane and form a pore. This conversion promotes the translocation of bound OF and 
LF across the endosomal membrane to the cytosol. 

The immunoadhesins of the present invention may be tailored to combat any 
30 pathogenic agent or poison and has significant advantages over what has been described 
in the art. The immunoadhesins of the present invention that are expressed in plants 
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would be tetrameric, rather than only dimeric. Immunoadhesins having multiple binding 
sites have a higher effective affinity for the pathogen/toxicant, thereby increasing the 
effectiveness of the immunoadhesin. In addition, the association of secretory component 
and immunoglobulin J chain with the immunoadhesin of the present invention increases 
5 the stability of the immunoadhesin in the mucosal environment (Corthesy, Biochem Soc 
Trans. 25:471-475, 1997). Secretory IgA, which is associated with secretory component, 
is the antibody isotype normally found in mucosal secretions, including milk and 
colostrum. Unlike other antibody isotypes, SIgA can pass through the gut with very little 
proteolytic degradation. It also is very stable in crude plant preparations at room 

10 temperature. A function of the secretory component appears to be to protect the antibody 
from the harsh environment of the mucosa (Paul, Fundamental Immunology, Raven Press, 
NY, Third Edition, pp. 303-304, 1993). Furthermore, the immunoadhesins of the present 
invention are significantly less expensive to produce in plants than in animal cell culture, 
and production in plants would make it safer for human use, since plants are not known to 

1 5 harbor any animal viruses. 

The preceding discussed documents as well as those which follow may be useful in 
understanding the invention but are not admitted to be prior art to the invention: 

Baumlein H, Wobus U, Pustell J, Kafatos FC (1986) The legumin gene family: structure of a B 
type gene of Vicia faba and a possible legumin gene specific regulatory element. Nucl. Acids Res. 
20 14: 2707-2713 

Becker D, Kemper E, Schell J, Masterson R (1992) New plant binary vectors with selectable 
markers located proximal to the left T-DNA border. Plant Mol. Biol. 20: 1 195-1 197 

Bradley KA, Mogridge J, Mourez M, Collier RJ, Young JAT (2001) Identification of the 
cellular receptor for anthrax toxin. Nature 414: pre-publication 

25 Chintalacharuvu KR, Raines M, Morrison SL (1994) Divergence of human alpha-chain 

constant region gene sequences. A novel recombinant alpha 2 gene. Journal of Immunology 152: 
5299-5304 

Crump et al. (1994) Comparative Antirhino viral Activities of Soluble Intercellular Adhesion 
Molecule- 1 (sICAM-1) and Chimeric ICAM-1 Immunoglobulin A Molecule. Antimicrobial 
30 Agents and Chemotherapy, 38:6, p. 1425-1427 
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Depicker A, Stachel S, Dhaese P, Zambryski P, Goodman HM (1982) Nopaline synthase: 
transcript mapping and DNA sequence. J. Mol. Appl. Genet. 1: 561-573 

Gielen J, De Beuckeieer M, Seurinck J, Deboeck F, De Greve H, Lemmers M, Van Montagu 
M, Schell J (1984) The complete nucleotide sequence of the TL-DNA of the Agrobacterium 
5 tumefaciens plasmid pTiAchS. Embo J 3: 835-46 

Greve et al.(1991) EP 0468257, Multimeric form of human rhinovirus receptor protein. 

Horsch RB, Fry JE, Hoffmann NL, Eichholtz D, Rogers SG, Fraley RT (1985) A simple and 
general method for transferring genes into plants. Science 227: 1229-1231 

Ingelbrecht I, Breyne P, Vancompernolle K, Jacobs A, Van Montagu M, Depicker A (1991) 
10 Transcriptional interference in transgenic plants. Gene 109: 239-242 

MacDonald MH, Mogen BD, Hunt AG (1991) Characterization of the polyadenylation signal 
from the T-DNA-encoded octopine synthase gene. Nucleic Acids Res 19: 5575-81 

Martin et al. (1993), Efficient Neutralization and Disruption of Rhinovirus by Chimeric ICAM- 
1 /Immunoglobulin Molecules. J. of Virology, 67:6, p. 3561-3568. 

15 Mogen BD, MacDonald MH, Leggewie G, Hunt AG (1992) Several distinct types of sequence 
elements are required for efficient mRNA 3 f end formation in a pea rbcS gene. Mol Cell Biol 12: 
5406-14 

Ni M, Cui D, Einstein J, Narasimhulu S, Vergara CE, Gelvin SB (1995) Strength and tissue 
specificity of chimeric promoters derived from the octopine and mannopine synthase genes. Plant 
20 Journal 7: 661-676 

Sawant SV, Singh PK, Gupta SK, Madnala R, Tuli R (1999) Conserved nucleotide sequences 
in highly expressed genes in plants. Journal of Genetics 78: 123-131 

St Croix B, Rago C, Velculescu V, Traverso G, Romans KE, Montgomery E, Lai A, Riggins 
GJ, Lengauer C, Vogelstein B, Kinzler KW (2000) Genes expressed in human tumor 
25 endothelium. Science 289: 1 197-202. 

Yamamoto YY, Tsuji H, Obokata J (1995) 5 f -leader of a photosystem I gene in Nicotiana 
sylvestris, psaDb, contains a translational enhancer. J Biol Chem 270: 12466-70. 
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SUMMARY OF THE INVENTION 

The present invention contemplates an immunoadhesin comprising a chimeric 
molecule having a toxin receptor protein linked to at least a portion of an immunoglobulin 
heavy chain, wherein J chain and secretory component are associated with the chimeric 
5 molecule. A toxin receptor as used here in is a receptor molecule or a part of a receptor 
molecule, at least a portion of which is a protein or peptide found on the surface of or in 
the cells of a host organism, to which toxicants, e.g., poisons or pathogenic organisms 
such as viruses, bacteria, fungi, parasites (or a molecule produced by such pathogenic 
organism) etc. attach as part of the disease generating process. In a preferred embodiment 
10 the toxin receptor will be the extra-cellular domain of a receptor molecule. The toxin 
receptor may be glycosylated or non-glycosylated. Alterations and modifications to the 
receptor protein or portion thereof are also contemplated, provided such modifications do 
not destroy the ability of the receptor to bind the toxin, toxicant, pathogen, or pathogen 
component. 

15 In some embodiments in which the receptor protein is a viral receptor protein, the 

immunoadhesin of the present invention is comprised of a rhinovirus receptor protein 
made of any combination of extracellular domains 1, 2, 3, 4 and 5 of the rhinovirus 
receptor protein, ICAM-1, linked to an immunoglobulin heavy chain. Also contemplated 
by the present invention are immunoadhesins of the present invention in which the 

20 immunoglobulin is IgA, IgAl, IgA2, IgGl, IgG2, IgG3, IgG4, IgM, IgD, IgE or a 

chimeric immunoglobulin heavy chain made up of domains or segments from different 
immunoglobulin isotypes. 

In other preferred embodiments of the present invention, the immunoadhesin 
comprises multiple chimeric ICAM-1 molecules associated with J chain and secretory 
25 component. The increase in valency results in a higher effective affinity for the 
rhinovirus, thereby increasing the effectiveness of the immunoadhesin. 

In a preferred embodiment of the present invention, all proteins used to make the 
immunoadhesin of the present invention are human proteins. In addition to production in 
plants or plant cells, the present invention contemplates an immunoadhesin expressed in 
30 mammalian cells, hairy root cultures, plant cells in tissue culture, and heterologous cells 
derived from plants, vertebrates or invertebrates. 
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In preferred embodiments of the present invention, the immunoadhesins are 
expressed, in plants, including monocotyledonous plants and dicotyledonous plants as a 
part of the plants genome. Expression in plants, as opposed to expression in cultured cells, 
allows for a significant reduction in the cost of producing the immunoadhesin. 

5 The present invention contemplates an immunoadhesin having plant-specific 

glycosylation. A gene coding for a polypeptide having within its amino acid sequence, the 
glycosylation signal asparagine-X-serine/threonine, where X can be any amino acid 
residue, is glycosylated via oligosaccharides linked to the asparagine residue of the 
sequence when expressed in a plant cell. See Marshall, Ann. Rev. Biochem., 41 :673 

10 (1972) and Marshall, Biochem. Soc. Symp., 40:17 (1974) for a general review of the 

polypeptide sequences that function as glycosylation signals. These signals are recognized 
in both mammalian and in plant cells. At the end of their maturation, proteins expressed 
in plants or plant cells have a different pattern of glycosylation than do proteins expressed 
in other types of cells, including mammalian cells and insect cells. Detailed studies 

15 characterizing plant-specific glycosylation and comparing it with glycosylation in other 
cell types have been performed, for example, in studies described by Cabanes-Macheteau 
et al., Glycobiology 9(4):365-372 (1999), and Altmann, Glycoconjugate J. 14:643-646 
(1997). These groups and others have shown that plant-specific glycosylation generates 
glycans that have xylose linked (3(1,2) to mannose, but xylose is not linked (3(1,2) to 

20 mannose as a result of glycosylation in mammalian and insect cells. Plant-specific 
glycosylation results in a fucose linked a(l,3) to the proximal GlcNAc, while 
glycosylation in mammalian cells results in a fucose linked a(l,6) to the proximal 
GlcNAc. Furthermore, plant-specific glycosylation does not result in the addition of a 
sialic acid to the terminus of the protein glycan, whereas in glycosylation in mammalian 

25 cells, sialic acid is added. 

In other embodiments, the immunoadhesin of the present invention is part of a 
composition comprising plant material and the immunoadhesin, associated with J chain 
and secretory component. The plant material present may be plant cell walls, plant 
organelles, plant cytoplasms, intact plant cells, viable plants, and the like. The particular 
30 plant materials or plant macromolecules that may be present include ribulose bisphosphate 
carboxylase, light harvesting complex, pigments, secondary metabolites or chlorophyll. 
Compositions of the present invention may have an immunoadhesin concentration of 

9 



WO 03/064992 



PCT/US02/34197 



between 0.001% and 99.9% mass excluding water. In other embodiments, the 
immunoadhesin is present in a concentration of 0.01% to 99% mass excluding water. In 
other embodiments, the compositions of the present invention have plant material or plant 
macromolecules present at a concentration of 0.01% to 99% mass excluding water. 

5 The present invention also contemplates methods for the treatment or prevention of 

human rhino virus infection in a subject, including reducing the infection by human 
rhino virus of host cells susceptible to infection by the virus, or reducing the initiation or 
spread of the common cold due to human rhinovirus, by a method comprising contacting 
the virus with an immunoadhesin of the present invention, wherein the immunoadhesin 

10 binds to the human rhinovirus and reduces infectivity. The immunoadhesin could mediate 
infection by competition with cell-surface ICAM-1 for binding sites, interference with 
virus entry or uncoating, and/or direct inactivation by premature release of viral RNA and 
formation of empty capsids (Arruda, et al., Antimicrob. Agents Chemother. 36:1 186-1 191, 
1992; Greve, et al., J. Virol. 65:6015-6023, 1991; Martin, et al., Nature 344:70-2, 1990; 

15 Martin, et al., J. Virol. 67:3561-8, 1993). In another embodiment, human rhinovirus 

infection in a subject is treated by a method comprising intranasally administering to the 
subject an effective amount of an immunoadhesin of the present invention, wherein the 
immunoadhesin reduces human rhinovirus infectivity. 

Other aspects of the invention contemplate immunoadhesins, compositions, and 
20 methods of use thereof in which the immunoadhesins are active against a bacterium or 
bacteria. In such aspects, the immunoadhesins contain a receptor protein that binds a 
bacterium of interest, e.g., Bacillus anthracis, or a pathogenic component thereof, e.g., 
protective antigen (PA). In some preferred embodiments for the treatment or prevention 
of anthrax, that receptor protein is the Anthrax toxin receptor protein or a portion thereof. 
25 The portion can be an extracellular domain or a portion of that domain. Additional or 
alternative embodiments can track those already described for ICAM immunoadhesins, 
discussed above. For example, at least a portion of an immunoglobulin heavy chain, a J 
chain, and a secretory component as described above are also present in some preferred 
embodiments. 

30 In another distinct aspect the invention features a method for reducing or 

preventing the binding of toxin or pathogen (e.g., the protective antigen (PA) of Bacillus 
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anthracis) to host cells susceptible to damage by said toxin or pathogen by contacting the 
toxin or pathogen with immunoadhesins that bind to it, thereby decoying the toxin or 
pathogen, and masking and/or ameliorating its pathological effect . Other aspects feature 
methods of reducing mortality and morbity based on this concept. 

5 In poison contexts, the invention also features poison antidotes that comprise a 

poison receptor or portion thereof linked to an immunoahesin. 

While anthrax and the common cold are two enumerated targets in various aspects 
of the invention, the invention as concerns immunoadhesins generally can make use of any 
known receptor protein or portion thereof that can bindto a component of any pathogen or 

10 toxicant, which component is required by that pathogen to exert its pathogenic or toxic 
effect. In addition to natural pathogens, toxicants include but are not limited to venom, 
carcinogens, mutagens, or other metabolic inhibitors or accelerators that can have a 
negative effect on cells, tissues, organs, or organisms. The principle described herein can 
thus be used to prevent, treat, ameliorate, or modulate any type of toxicity or pathogen- 

15 mediated disease or symptom caused by such.. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates pSSPHuA2, vector in which DNAs encoding a chimeric ICAM-1 
molecule containing the first five domains of human ICAM-1 and the Fc region of human 
IgA2m(2) were fused [SEQ ID NO:9, 48]. This vector contains the SuperMas promoter 

20 for driving the expression of a signal peptide and the constant regions of the human 

IgA2m(2) heavy-chain. Sequences encoding ICAM domains 1-5 were amplified, by PCR, 
to contain convenient restriction sites (5 ' Spel and 3 * Spe I) for insertion between the 
signal peptide and the Cal domain. DNA encoding an ER retention signal (RSEKDEL) 
[SEQ ID NO:5] was appended to the 3' end of the heavy-chain in order to boost the 

25 expression level of the construct. 

FIG. 2 illustrates a chimeric ICAM molecule. 2A shows the DNA expression 
cassette from which the chimeric ICAM-1 molecule was produced. 2B shows the amino 
acid sequence, after signal peptide cleavage, of the mature form of the fusion protein [SEQ 
ID NO:8]. Amino acids introduced by the cloning procedure are underlined and mark the 
30 junction between the five extracellular domains of ICAM-1 and the Cal-Ca3 domains of 



ll 



WO 03/064992 



PCT/US02/34197 



the IgA2m(2) heavy chain. The bolded N's indicate the fifteen potential glycosylation 
sites. 

FIG. 3 illustrates the expression of the immunoadhesin in independently 
transformed tobacco calli. 3 A shows immunoblots of non-reducing SDS-polyacrylamide 
5 gels on which samples containing different transformed tobacco calli (C) and aqueous 
extracts (Aq) were run and probed for the presence of human ICAM. The molecular 
weight markers are indicated, and the reference standard (R) was a mixture (~75 ng each) 
of human ICAM (-75 kD) and human SigA (»250 kD). 3B shows immunoblots of 
nonreducing SDS-polyacrylamide gels containing various fractions of partially purified 
1 0 immunoadhesin from callus Rhi 1 07-1 1 . The purification fractions analyzed were juice (J), 
G-100 fraction (G), sterile filtered G-100 fraction (SG), and a mixture of reference 
standards of human SigA (75 ng) and human ICAM-1 (75ng) (RS). 

Blots were probed with antibodies against human ICAM (-ICAM), human IgA 
heavy chain (~a), human secretory component (~SC) and human J chain (~J). Secondary, 
15 enzyme-conjugated antibodies were employed as necessary to label immuno-positive 
bands with alkaline phosphatase. 

FIG. 4 illustrates the results of an enzyme-linked immunosorbent assay (ELISA) 
showing competition between plant extract and soluble ICAM-1 for binding to an anti- 
ICAM mAb. For the assay, 96-well plates were coated with 0.25 |xg soluble ICAM-l/ml. 
20 The squares represent the increasing concentrations of sICAM and the circles represent the 
increasing amounts of callus extract (sterile filtered fraction from G-100) used to compete 
with the adhered ICAM for a constant amount of a mouse (anti-human ICAM) antibody. 

FIG. 5 illustrates the results of an assay showing the ability of an immunoadhesin 
to inhibit human rhino virus killing of HeLa cells (cytopathic effect, or CPE, assay). 5 A 

25 shows the results of an assay comparing the CPE of human rhino virus on HeLa cells in the 
presence of partially purified extracts containing either the immunoadhesin in the ICAM- 
Fc fusion (ICl-5D/IgA) or containing an antibody against doxorubicin. (The right side-up 
and upside-down triangles represent two extracts derived from Rhi 107-1 1, containing the 
immunoadhesin.) 5B shows the results of an assay comparing the CPE of human 

30 rhinovirus on HeLa cells in the presence of soluble human ICAM-1 or an extract from the 
immunoadhesin in the ICAM-Fc fusion (ICl-5D/IgA). The Inset shows scale expansion 
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in the range of the IC50 for soluble ICAM (1.35 ^ig/ml) and for ICl-5D/IgA (0.12 |ig/ml; 
11.3 fold-less). 

FIG. 6 shows an evaluation of the production necessities for making 1 gram of 
finished immunoadhesin. In this diagram, the number of plants needed for 1 g of 
5 immunoadhesin, at 20% yield, at expected levels of expression and plant weight is 

illustrated. At different levels of immunoadhesin expression (mg/kg fresh weight) and 
overall recovery (set at 20%), the weight of each plant, and so the total number of plants, 
may be determined for a specified production target (1 g/harvest) within a window (dotted 
square) of reasonable possibilities. The number of required plants decreases, inversely, 
10 with the number of specified growth and re-growth periods. The expected biomass 

production, a function of time and growth conditions, influences the time to harvest and 
the time between harvests. These growth periods can be adjusted to the realities of the 
purification schedule by staggering planting and harvesting dates. 

FIG. 7 shows the coding and amino acid sequences of each of the immunoglobulin 
15 genes and proteins listed in Table 2 [SEQ ID NO:15 through 47 and SEQ ED NO:52 
through 62]. 

FIG. 8 shows the sequences of plasmids used to transform plants, as described in 
Example 2, for use in studies of the expression of immunoadhesins of the present 
invention. 

20 FIG. 8 A shows the nucleotide [SEQ ID NO:9] and protein [SEQ ID NO:48] 

sequences for plasmids PSSpICAMHuA2 

FIG 8 B shows the nucleotide and protein [SEQ ID NO: 10] sequence for the bean 
legumin signal peptide. 

FIG 8 C shows the nucleotide [SEQ ID NO:l 1] and amino acid [SEQ ID NO:50] 
25 sequence of the protein coding region of pSHuJ. 

FIG 8 D shows the nucleotide [SEQ ID NO:12] and amino acid [SEQ ID NO:51] 
sequence of protein coding region of pSHuSC. 

FIG 8 E shows the nucleotide sequence [SEQ ID NO: 13] of plasmid pBMSP-1. 
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FIG 8 F shows the nucleotide sequence [SEQ ID NO: 14] of plasmid pBMSP- 
lspJSC. 

FIG. 9 contains nucleotide and protein sequences SEQ ID NO:l; SEQ ID NO:2; 
SEQ ID NO:3; SEQ ID NO:4; SEQ ID NO:5; SEQ ID NO:6; SEQ ID NO:7; SEQ ID 
NO: 8, for ICAM-1, and human IgA2 and other nucleotide sequences. 

FIG. 10 shows the full nucleotide (SEQ ID NO: 98) and amino acid sequence 
(SEQ ID NO: 99) of the ATR-IgA2 fusion (an immunoadhesin). 

FIG. 1 1 shows the sequence (SEQ ID NO: 100) between the T-DNA borders of the 
plasmid pGPTV-kan-ocs-ATR-IgA2. 

FIG. 12 shows the sequence (SEQ ID NO: 101) between the T-DNA borders of the 
plasmid pGPTV-hpt-ocs-35SJ/SC. 

DETAILED DESCRIPTION OF THE INVENTION 

A. Definitions 

As used herein, the following abbreviations and terms include, but are not 
necessarily limited to, the following definitions. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of immunology, molecular biology, microbiology, cell biology 
and recombinant DNA, which are within the skill of the art. See, e.g., Sambrook, et al., 
Molecular Cloning: A Laboratory Manual, 2nd edition (1989); Current Protocols In 
Molecular Biology (F.M. Ausubel, et al. eds., (1987)); the series Methods In Enzymology 
(Academic Press, Inc.); M.J. MacPherson, et al., eds. Per 2: A Practical Approach (1995); 
Harlow and Lane, eds, Antibodies: A Laboratory Manual (1988), and H. Jones, Methods 
In Molecular Biology vol. 49, "Plant Gene Transfer And Expression Protocols" (1995). 

Immunoglobulin molecule or Antibody. A polypeptide or multimeric protein 
containing the immunologically active portions of an immunoglobulin heavy chain and 
immunoglobulin light chain covalently coupled together and capable of specifically 
combining with antigen. The immunoglobulins or antibody molecules are a large family 
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of molecules that include several types of molecules such as IgD, IgG, IgA, secretory IgA 
(SIgA), IgM, and IgE. 

Construct or Vector. An artificially assembled DNA segment to be transferred into 
a target plant tissue or cell. Typically, the construct will include the gene or genes of a 
5 particular interest, a marker gene and appropriate control sequences. The term "plasmid" 
refers to an autonomous, self-replicating extrachromosomal DNA molecule. In a preferred 
embodiment, the plasmid constructs of the present invention contain sequences coding for 
heavy and light chains of an antibody. Plasmid constructs containing suitable regulatory 
elements are also referred to as "expression cassettes." In a preferred embodiment, a 
10 plasmid construct can also contain a screening or selectable marker, for example an 
antibiotic resistance gene. 

Selectable marker. A gene that encodes a product that allows the growth of 
transgenic tissue on a selective medium. Non-limiting examples of selectable markers 
include genes encoding for antibiotic resistance, e.g., ampicillin, kanamycin, or the like. 
15 Other selectable markers will be known to those of skill in the art. 

Transgenic plant. Genetically engineered plant or progeny of genetically 
engineered plants. The transgenic plant usually contains material from at least one 
unrelated organism, such as a virus, another plant or animal. 

Chimeric ICAM-1 molecule: The fusion of any combination of the extracellular 
20 domains 1, 2, 3, 4 and 5 of ICAM-1 with at least a part of an immunoglobulin heavy chain 
protein, made by linking ICAM-1 sequence upstream of an immunoglobulin heavy chain 
gene sequence and expressing the encoded protein from the construct. In antibacterial 
embodiments, one or more receptor proteins effective to bind a bacterium or bacteria of 
interest or subcomponent thereof, such as a protein produced by the bacteria and required 
25 for the bacteria to exert its pathogenic effect, are used instead of, or in addition to, ICAM- 
1 . Many such receptor proteins are known and can be implemented for use in appropriate 
aspects of the invention by those of ordinary skill in the art without exercising undue 
experimentation. An example is provided below utilizing one such protein that binds to a 
protein involved in the pathogenic mechanism caused by the bacterium that causes human 
30 anthrax. The same concept can be used to target other viruses besides rhinoviruses, e.g., 
by making use of appropriate host receptor proteins in immunoadhesin form. 
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Chimeric immunoglobulin heavy chain: An immunoglobulin derived heavy chain 
having at least a portion of its amino acid sequence derived from an immunoglobulin 
heavy chain of a different isotype or subtype or some other peptide, polypeptide or 
protein. Typically, a chimeric immunoglobulin heavy chain has its amino acid residue 
sequence derived from at least two different isotypes or subtypes of immunoglobulin 
heavy chain. 

Dicotyledonous plants (dicots): Flowering plants whose embryos have two seed 
halves or cotyledons. Examples of dicots are: tobacco; tomato; the legumes including 
alfalfa; oaks; maples; roses; mints; squashes; daisies; walnuts; cacti; violets and 
buttercups. 

Effective amount: An effective amount of an immunoadhesin of the present 
invention is sufficient to detectably inhibit viral or bacterial infection (as the case may be), 
cytotoxicity or replication; or to reduce the severity or length of infection. 

Human rhinovirus (HRV): A nonenveloped RNA virus representing a subgroup of 
picornavirus, that is a major cause of the common cold in humans. Rhinoviruses are 
described in Rhinoviruses, Reo viruses, and Parvoviruses, pp. 1057-1059, Zinsser 
Microbiology, Joklik et al., eds. Appleton and Lange (1992). 

Immunoadhesin : A complex containing a chimeric receptor protein molecule, and 
optionally containing secretory component, and J chain. 

Immunoglobulin heavy chain: A polypeptide that contains at least a portion of the 
antigen binding domain of an immunoglobulin and at least a portion of a variable region of 
an immunoglobulin heavy chain or at least a portion of a constant region of an 
immunoglobulin heavy chain. Thus, the immunoglobulin derived heavy chain has 
significant regions of amino acid sequence homology with a member of the 
immunoglobulin gene superfamily. For example, the heavy chain in an Fab fragment is an 
immunoglobulin-derived heavy chain. 

Immunoglobulin light chain: A polypeptide that contains at least a portion of the 
antigen binding domain of an immunoglobulin and at least a portion of the variable region 
or at least a portion of a constant region of an immunoglobulin light chain. Thus, the 



16 



WO 03/064992 



PCT/US02/34197 



immunoglobulin-derived light chain has significant regions of amino acid homology with 
a member of the immunoglobulin gene superfamily. 

Immunoglobulin molecule: A protein containing the immunologically-active 
portions of an immunoglobulin heavy chain and immunoglobulin light chain covalently 
coupled together and capable of specifically combining with antigen. 

ICAM-1: Intercellular adhesion molecule- 1. In humans, ICAM-1 functions as the 
receptor for human rhinovirus. 

J chain: A polypeptide that is involved in the polymerization of immunoglobulins 
and transport of polymerized immunoglobulins through epithelial cells. See, The 
Immunoglobulin Helper: The J Chain in Immunoglobulin Genes, at pg. 345, Academic 
Press (1989). J chain is found in pentameric IgM and dimeric IgA and typically attached 
via disulphide bonds. J chain has been studied in both mouse and human. 

Monocotyledonous plants (monocots): Flowering plants whose embryos have one 
cotyledon or seed leaf. Examples of monocots are: lilies; grasses; corn; grains, including 
oats, wheat and barley; orchids; irises; onions and palms. 

Glycosylation: The modification of a protein by oligosaccharides. See, Marshall, 
Ann. Rev. Biochem., 41:673 (1972) and Marshall, Biochem. Soc. Symp., 40:17 (1974) for 
a general review of the polypeptide sequences that function as glycosylation signals. 
These signals are recognized in both mammalian and in plant cells. 

Plant-specific glycosylation: The glycosylation pattern found on plant-expressed 
proteins, which is different from that found in proteins made in mammalian or insect cells. 
Proteins expressed in plants or plant cells have a different pattern of glycosylation than do 
proteins expressed in other types of cells, including mammalian cells and insect cells. 
Detailed studies characterizing plant- specific glycosylation and comparing it with 
glycosylation in other cell types have been performed by Cabanes-Macheteau et al., 
Glycobiology 9(4):365-372 (1999), Lerouge et al., Plant Molecular Biology 38:31-48 
(1998) and Altmann, Glycoconjugate J. 14:643-646 (1997). Plant-specific glycosylation 
generates glycans that have xylose linked |3(1 ,2) to mannose. Neither mammalian nor 
insect glycosylation generate xylose linked 0(1,2) to mannose. Plants do not have a sialic 
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acid linked to the terminus of the glycan, whereas mammalian cells do. In addition, plant- 
specific glycosylation results in a fucose linked a(l,3) to the proximal GlcNAc, while 
glycosylation in mammalian cells results in a fucose linked a(l,6) to the proximal 
GlcNAc. 

5 Secretory component (SC): A component of secretory immunoglobulins that helps 

to protect the immunoglobulin against inactivating agents thereby increasing the biological 
effectiveness of secretory immunoglobulin. The secretory component may be from any 
mammal or rodent including mouse or human. 

sICAM: A naturally-occurring soluble truncated form of ICAM-1 lacking both the 
10 hydrophobic transmembrane domain and the carboxy-terminal cytoplasmic domain of 
ICAM. 

T he articles, patents and patent applications cited in this document are incorporated 
into this document as if set forth in full. 

Although much of the discussion which follows is directed to ICAM-1 
15 immunoadhesin aspects and embodiments, it will be clear to one of ordinary skill that 
other antiviral or antibacterial immunoadhesins can be similarly produced by 
incorporating receptor protein molecules other than ICAM-1. Indeed, examples 10-12 are 
directed to antibacterial embodiments in which the anthrax toxin receptor (ATR) is used 
instead of ICAM-1. 

20 B. Immunoadhesins Containing Chimeric ICAM Molecules 

The present invention provides novel methods for producing immunoadhesin 
molecules containing chimeric receptor proteins, e.g., ICAM-1 receptor proteins. ICAM-1 
immunoadhesins, for example, contain chimeric ICAM-1 molecules made up of a 
rhinovirus receptor protein linked to a portion of an immunoglobulin heavy chain 
25 molecule in association with J chain and secretory component. The chimeric ICAM-1 
molecules of such aspects of the present invention contain two molecules derived from 
different sources: a rhinovirus receptor protein portion and an immunoglobulin chain 
portion. The rhinovirus receptor protein is derived from the intercellular adhesion 
molecule 1 (ICAM-1). The nucleotide sequence for the human rhinovirus receptor ICAM- 
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1 has been determined and characterized by Staunton, et aL, Cell 52:925-933 (1988); 
Greve, et al. Cell 56:839-847 (1989); Greve, et al. J. Virology 65:6015-6023 (1991); 
Staunton, et al., Cell, 61:243-254 (1990) and described in Sequence ID No. 3 and 
GenBank accession no. M24283. 

The ICAM-1 molecule is a membrane protein (SEQ ID NOS: 1 and 2) that has 5 
extracellular domains, a hydrophobic transmembrane domain and a short cytoplasmic 
domain. These features have been described by Casasnovas, et al., Proc. Natl. Acad. Sci. 
U.S.A., 95:4134-4139 (1998) and Staunton, et al, Cell 52:925-933 (1988). Of particular 
use in appropriate aspects of the present invention are the domains of the ICAM-1 
molecule that are responsible for the binding of human rhino viruses which have been 
localized to the N-terminal domains 1 and 2 (Greve, et al., J. Virol., 65:6015-6023 1991, 
and Staunton, et al., Cell, 61:243-245 1990. Such aspects also contemplate(s) rhinovirus 
receptor protein portions which include any combination of extracellular domains 1, 2, 3, 
4, and 5 of the ICAM-1 molecule. In particular preferred embodiments, the rhinovirus 
receptor protein portion includes domains 1 and 2 of the ICAM-1 molecule and in other 
preferred embodiments domains 1, 2, 3, 4 and 5 of the ICAM-1 molecule are present. 

The boundaries of the 5 extracellular domains of ICAM-1 are well known in the art 
and described in Staunton, et al., Cell 52:925-933 (1988). The approximated domain 
boundaries are shown in Table 1 below [SEQ ID NO:2] . 

Table 1 

ICAM-1 Domains Amino Acids 

1 1-88 

2 89-105 

3 106-284 

4 285-385 

5 386-453 

As used in some aspects and embodiments of the present invention, the ICAM-1 
domain 1 is from about residue 1 to about residue 88; domain 2 is from about residue 89 to 
about residue 105; domain 3 is from about residue 106 to about residue 284; domain 4 is 
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from about residue 285 to about 385; and domain 5 is from about residue 386 to 453. One 
of skill in the art will understand that the exact boundaries of these domains may vary. 

The chimeric ICAM-1 molecules preferably contain at least a portion of an IgM or 
IgA heavy chain which allows that immunoglobulin heavy chain to bind to 
5 immunoglobulin J chain and thereby binds to the secretory component. It is contemplated 
that the portion of the chimeric ICAM-1 molecule derived from the immunoglobulin 
heavy chain may be comprised of individual domains selected from the IgA heavy chain 
or the IgM heavy chain or from some other isotype of heavy chain. It is also contemplated 
that an immunoglobulin domain derived from an immunoglobulin heavy chain other than 
10 IgA or IgM or from an immunoglobulin light chain may be molecularly engineered to bind 
immunoglobulin J chain and thus may be used to produce immunoglobulins and 
immunoadhesins of the present invention. 

One skilled in the art will understand that immunoglobulins consist of domains 
which are approximately 100-1 10 amino acid residues. These various domains are well 

15 known in the art and have known boundaries. The removal of a single domain and its 

replacement with a domain of another antibody molecule is easily achieved with modem 
molecular biology. The domains are globular structures which are stabilized by intrachain 
disulfide bonds. This confers a discrete shape and makes the domains a self-contained 
unit that can be replaced or interchanged with other similarly shaped domains. The heavy 

20 chain constant region domains of the immunoglobulins confer various properties known as 
antibody effector functions on a particular molecule containing that domain. Example 
effector functions include complement fixation, placental transfer, binding to staphyloccal 
protein, binding to streptococcal protein G, binding to mononuclear cells, neutrophils or 
mast cells and basophils. The association of particular domains and particular 

25 immunoglobulin isotypes with these effector functions is well known and for example, 
described in Immunology, Roitt et al., Mosby St. Louis, Mo. (1993 3rd Ed.) 

One of skill in the art will be able to identify immunoglobulin heavy chain constant 
region sequences. For example, a number of immunoglobulin DNA and protein sequences 
are available through GenBank. Table 2 shows the GenBank Accession numbers of 
30 immunoglobulin heavy chain genes and the proteins encoded by the genes. The sequences 
listed in Table 2 are shown in Fig. 7. 
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The ICAM-1 immunoadhesins of the present invention may, in addition to the 
chimeric ICAM-1 molecule, contain immunoglobulin light chains, or immunoglobulin J 
chain bound to the immunoglobulin derived heavy chains. In preferred embodiments, the 
immunoadhesin of the present invention comprises two or four chimeric ICAM-1 
molecules and an immunoglobulin J chain bound to at least one of the chimeric ICAM-1 
molecules. The J chain is described and known in the art. See, for example, M. Koshland, 
The Immunoglobulin Helper: The J Chain, in Immunoglobulin Genes, Academic Press, 
London, pg. 345, (1989) and Matsuuchi et al., Proc. Natl. Acad. Sci. U.S.A., 83:456-460 
(1986). The sequence of the immunoglobulin J chain is available on various databases in 
the United States. 

The immunoadhesin may have a secretory component associated with the chimeric 
ICAM-1 molecule. This association may occur by hydrogen bonds, disulfide bonds, 
covalent bonds, ionic interactions or combinations of these various bonds. Typically, 
chimeric ICAM-1 molecules are held together by disulfide bonds between the molecules. 
The interaction of the chimeric ICAM-1 molecules may be non-covalent or disulfide 
bonding. 

The present invention contemplates the use of secretory component from a number 
of different species, including human, rat, rabbit, bovine and the like. The nucleotide 
sequences for these molecules are well known in the art. For example, U.S. Patent 
6,046,037 contains many of the sequences and this patent is incorporated herein by 
reference. The immunoadhesins of the present invention containing the secretory 
component, the chimeric ICAM-1 molecule and J chain are typically bonded together by 
one of the following: hydrogen bonds, disulfide bonds, covalent bonds, ionic interactions 
or combinations of these bonds. 

The present invention also contemplates immunoadhesins which comprise more 
than one receptor protein molecule, e.g., ICAM-1. ICAM-1 immunoadhesins, for example, 
may contain chimeric ICAM-1 molecules that are monomelic units and not disulfide 
bonded to other chimeric ICAM-1 molecules. In preferred embodiments, the 
immunoadhesin does contain chimeric ICAM-1 molecules that are in association with 
other chimeric ICAM-1 molecules to form dimers and other multivalent molecules. 
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Typically the chimeric ICAM-1 molecule is present as a dimer because of the association 
of the immunoglobulin portion of the chimeric molecule. The immunoglobulin portion of 
the chimeric ICAM-1 molecule allows the association of two chimeric ICAM-1 molecules 
to form a dimeric molecule having two active binding portions made up of the rhinovirus 
5 receptor protein portion. In preferred embodiments, dimerization occurs via the disulfide 
bonding regions that normally occur between the immunoglobulin domains as part of a 
naturally-occurring immunoglobulin molecule and the native immunoglobulin protein. 
One of skill in the art will understand that these disulfide bonds that are normally present 
in the native immunoglobulin molecule can be modified, moved and removed while still 
10 maintaining the ability to form a dimer of the chimeric ICAM-1 molecules. 

In other preferred embodiments, the immunoadhesin contains multimeric forms of 
the chimeric ICAM-1 molecule due to the association of J chain with the immunoglobulin 
portion of the chimeric ICAM molecule. The association of J chain with the dimer of two 
chimeric ICAM-1 molecules allows the formation of tetrameric forms of the 

1 5 immunoadhesin. In a preferred embodiment, the immunoglobulin portion of the chimeric 
ICAM-1 molecule is derived from the IgA molecule, and the addition of J chain allows the 
formation of a tetrameric complex containing four chimeric ICAM-1 molecules and four 
binding sites. In other preferred embodiments, the immunoglobulin heavy-chain portion 
of the chimeric molecule is derived from IgM and multivalent complexes containing ten or 

20 twelve molecules may be formed. In other preferred embodiments, in which the chimeric 
ICAM-1 molecule uses a chimeric immunoglobulin heavy-chain, the chimeric ICAM-1 
molecule may form dimers or other higher order multivalent complexes through the 
domains from either IgA or IgM that are responsible for J chain binding. In other chimeric 
immunoglobulin molecules the portions of the immunoglobulin responsible for the 

25 disulfide bonding between the two immunoglobulin heavy-chains and/or the disulfide 
bonding between an immunoglobulin light-chain and heavy-chain may be placed in the 
chimeric immunoglobulin molecule to allow the formation of dimers or other high order 
multivalent complexes. 

Various aspects and embodiments contemplate a chimeric ICAM-1 molecule in 
30 which the immunoglobulin domains comprising the heavy chain are derived from different 
isotypes of either heavy or light chain immunoglobulins. One skilled in the art will 
understand that using molecular techniques, these domains can be substituted for a similar 
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domain and thus produce an immunoglobulin that is a hybrid between two different 
immunoglobulin molecules. These chimeric immunoglobulins allow immunoadhesins 
containing secretory component to be constructed that contain a variety of different and 
desirable properties that are conferred by different immunoglobulin domains. 

5 Also contemplated are chimeric ICAM-1 molecules in which the portion of the 

chimeric molecule derived from immunoglobulin, heavy or light chain may contain less 
than an entire domain derived from a different immunoglobulin molecule. The same 
molecular techniques may be employed to produce such chimeric ICAM-1 molecules. 

In preferred embodiments, the chimeric ICAM-1 molecules contain at least the 
10 CHI, CH2, CH3, domain of mouse or human IgAl, IgA2 or IgM. Other preferred 

embodiments of the present invention contain immunoglobulin domains that include at 
least the Cjil, C^i2, Cji3, or C^i4 domains of IgM. 

Preferred chimeric ICAM-1 molecules contain domains from two different 
isotypes of human immuno globulin. Preferred chimeric ICAM-1 molecules that include 

15 immunoglobulins that contain immunoglobulin domains including at least the CHI, CH2, 
or CH3 of human IgG, IgGl, IgG2, IgG3, IgG4, IgAl, IgA2, IgE, or IgD. Other preferred 
immunoglobulins for use as part of chimeric ICAM-1 molecules include immunoglobulins 
that contain domains from at least the CHI, CH2, CH3, or CH4 domain of IgM or IgE. 
The present invention also contemplates chimeric ICAM-1 molecules that contain 

20 immunoglobulin domains derived from at least two different isotypes of mammalian 

immunoglobulins. Generally, any of the mammalian immunoglobulins can be used in the 
preferred embodiments, such as the following isotypes: any isotype of IgG, any isotype of 
IgA, IgE, IgD or IgM. Chimeric ICAM-1 molecules derived from a species such as 
human, mouse or other mammals are contemplated. In preferred embodiments, the 

25 chimeric ICAM-1 molecule contains the portion of IgA or IgM responsible for the 

association of J chain with the IgA and IgM. Thus, by using a chimeric immunoglobulin 
in the chimeric ICAM-1 molecule, the J chain may associate with a chimeric 
immunoglobulin that is predominantly of an isotype that does not bind J chain or secretory 
component. 

30 The present invention also contemplates chimeric molecules that contain 

immunoglobulin domains derived from two different isotypes of rodent or primate 
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immunoglobulin. The isotypes of rodent or primate immunoglobulin are well known in 
the art. The chimeric molecules of the present invention may contain immunoglobulin 
derived heavy chains that include at least one of the following immunoglobulin domains: 
the CHI, CH2, or CH3 domains of a mouse IgG, IgGl, IgG2a, IgG2b, IgG3, IgA, IgE, or 
5 IgD; the CHI, CH2, CH3 or CH4 domain of mouse IgE or IgM; the CHI domain of a 
human IgG, IgGl, IgG2, IgG3, IgG4, IgAl, IgA2, or IgD; the CHI, CH2, CH3, CH4 
domain of human IgM or IgE; the CHI, CH2, or CH3 domain of an isotype of mammalian 
IgG, an isotype of IgA, IgE, or IgD; the CHI, CH2, CH3 or CH4 domain of a mammalian 
IgE or IgM; the CHI, CH2 , or CH3 domain of an isotype of rodent IgG, IgA, IgE, or IgD; 

10 the CHI, CH2, CH3 or CH4 domain of a rodent IgE or IgM; the CHI, CH2, or CH3 

domain of an isotype of animal IgG, an isotype of IgA, IgE, or IgD; and the CHI, CH2, 
CH3, or CH4 domain of an animal IgE or IgM. The present invention also contemplates 
the replacement or addition of protein domains derived from molecules that are members 
of the immunoglobulin superfamily into the chimeric molecules, e.g., chimeric ICAM-1. 

15 The molecules that belong to the immunoglobulin superfamily have amino acid residue 
sequence and nucleic acid sequence homology to immunoglobulins. The molecules that 
are part of the immunoglobulin superfamily can be identified by amino acid or nucleic 
acid sequence homology. See, for example, p. 361 of Immunoglobulin Genes, Academic 
Press (1989). 

20 In preferred embodiments of the present invention, the immunoadhesin is 

expressed by methods that generate an immunoadhesin having plant-specific 
glycosylation. It is well-known in the art that glycosylation is a major modification of 
proteins in plant cells (Lerouge et al., Plant Molecular Biology 38:31-48, 1998). 
Glycosylation of proteins also occurs in other cell types, including mammalian and insect 

25 cells. The glycosylation process starts in the endoplasmic reticulum by the co- 

translational transfer of a precursor oligosaccharide to specific residues of the nascent 
polypeptide chain. Processing of this oligosaccharide into different types of glycans, 
which differ in the types of residues present and the nature of the linkages between the 
residues, occurs in the secretory pathway as the glycoprotein moves from the endoplasmic 

30 reticulum to its final destination. One of skill in the art will understand that at the end of 
their maturation, proteins expressed in plants or plant cells have a different pattern of 
glycosylation than do proteins expressed in other types of cells, including mammalian 
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cells and insect cells. Detailed studies characterizing plant-specific glycosylation and 
comparing it with glycosylation in other cell types have been performed, for example, in 
studies described by Cabanes-Macheteau et al., Glycobiology 9(4):365-372 (1999), and 
Altmann, Glycoconjugate J. 14:643-646 (1997). These groups and others have shown that 
5 plant-specific glycosylation generates glycans that have xylose linked p(l,2) to mannose, 
but xylose is not linked P(l,2) to mannose as a result of glycosylation in mammalian and 
insect cells. Plant-specific glycosylation results in a fucose linked a(l,3) to the proximal 
GlcNAc, while glycosylation in mammalian cells results in a fucose linked a(l,6) to the 
proximal GlcNAc. Furthermore, plant-specific glycosylation does not result in the 
10 addition of a sialic acid to the terminus of the protein glycan, whereas in glycosylation in 
mammalian cells, sialic acid is added. 

The immunoadhesin of the present invention that is glycosylated in a plant-specific 
manner can contain a chimeric molecule, e.g., chimeric ICAM-1, that includes any 
combination of extracellular domains, e.g., domains 1, 2, 3, 4, and 5 of the ICAM-1 

15 molecule. FIG. 2B shows the amino acid sequence of the chimeric ICAM-l/IgA2 
molecule (SEQ ID NO: 8) of the present invention, that contains all five domains of 
ICAM-1 . The bolded N's represent asparagine residues to which oligosaccharide moieties 
are linked during glycosylation in plant cells, as well as mammalian and insect cells. One 
of skill in the art will know that the glycosylation sites are the tripeptide Asn-X-Ser/Thr 

20 where X can be any amino acid except proline and aspartic acid (Komfeld and Kornfeld, 
Annu Rev Biochem 54:631-664, 1985). It will therefore be known to one of skill in the art 
that which amino acids of the protein having plant-specific glycosylation would depend on 
which domains of ICAM-1 are present. Because the sequence and domain boundaries of 
ICAM-1 are known (see Staunton et. al., Cell 52:925-933, 1988), it would be evident to 

25 one of skill in the art how to determine the plant-specific glycosylation sites on any 
potential combination of any of the five ICAM-1 domains. 

In other preferred ICAM-1 aspects and embodiments of the present invention, the 
immunoadhesin having plant-specific glycosylation and containing a chimeric ICAM-1 
molecule having any combination of ICAM-1 extracellular domains 1, 2, 3, 4 and 5 further 
30 comprises a J chain and secretory component associated with said chimeric ICAM-1 

molecule. As was true with respect to the chimeric ICAM-1 molecule, one of skill in the 
art will be able to identify the sites for plant-specific glycosylation in the J chain and 
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secretory component sequences. The same principle applies for immunoahesins of the 
invention that contain chimeric receptor proteins other than chimeric ICAM-1. 

The present invention contemplates immunoadhesins having plant-specific 
glycosylation, that contain a chimeric molecule, e.g., a chimeric ICAM-1 molecule, in 
5 which the immunoglobulin heavy chain is selected from the group of IgA (SEQ ID 
NOS:15-18 and 52-53), IgAl (SEQ ID NOS:15-16 and 52), IgA2 (SEQ ID NO:17 and 
53), IgGl (SEQ ID NOS: 19-20 and 54), IgG2 (SEQ ID NOS:21-22 and 55), IgG3 (SEQ 
ID NOS:23-24 and 56), IgG4 (SEQ ID NOS:25-26 and 57), IgM (SEQ ID NOS:46-47 and 
61-62), IgD (SEQ ID NOS:27-33, 35-36, 38, 40, and 42), IgE (SEQ ID NOS:44-45 and 

10 59-60), and a chimeric immunoglobulin heavy chain. One of skill in the art will know that 
which of these immunoglobulin heavy chain sequences, or which combination of 
immunoglobulin heavy chain sequences are combined in a chimeric immunoglobulin 
heavy chain, will have an effect on the number and location of glycosylation sites in the 
chimeric molecule of the immunoadhesin. As was true with respect to the chimeric 

15 molecule, one of skill in the art will be able to identify the sites for plant-specific 
glycosylation in the immunoglobulin heavy chain sequences, including the various 
chimeric immunoglobulin heavy chain sequences that can be constructed. 

Also provided herein are immunoadhesin functional derivatives. By "functional 
derivative" is meant a "chemical derivative," "fragment," or "variant," of the polypeptide 

20 or nucleic acid of the invention which retains at least a portion of the function of the 

protein, for example reactivity with an antibody specific for the protein, enzymatic activity 
or binding activity, which permits its utility in accordance with the present invention. It is 
well known in the art that due to the degeneracy of the genetic code numerous different 
nucleic acid sequences can code for the same amino acid sequence. It is also well known 

25 in the art that conservative changes in amino acid can be made to arrive at a protein or 
polypeptide that retains the functionality of the original. In both cases, all permutations 
are intended to be covered by this disclosure. 

The derivatives may also be engineered according to routine methods to include an 
affinity purification tag such that large quantities and/or relatively pure or isolated 
30 quantities of immunoadhesin may be produced. Many different versions of tag exist that 
can be incorporated into one or more components of the immunoadhesin, preferably not 
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destroying the desired binding activity of the immunoadhesin in the absence of tag. Such 
tags can be engineered as expressible encoded nucleic acid sequence fused with nucleic 
acid sequences encoding the immunoadhesins of the invention. The tags may further be 
engineered to be removable, e.g., with a commercially available enzyme. 

5 Further, it is possible to delete codons or to substitute one or more codons with 

codons other than degenerate codons to produce a structurally modified polypeptide, but 
one which has substantially the same utility activity as the polypeptide produced by the 
unmodified nucleic acid molecule. As recognized in the art, the two polypeptides can be 
functionally equivalent, as are the two nucleic acid molecules that give rise to their 
10 production, even though the differences between the nucleic acid molecules are not related 
to the degeneracy of the genetic code. 

Manipulations of this sort, and post-production chemical derivatization may be 
implemented, e.g., to improve stability, solubility, absorption, biological or therapeutic 
effect, and/or biological half-life. Moieties capable of mediating such effects are 

15 disclosed, for example, in Remington's Pharmaceutical Sciences, 18th ed., Mack 

Publishing Co., Easton, PA (1990). A functional derivative intended to be within the 
scope of the present invention is a "variant" polypeptide which either lacks one or more 
amino acids or contains additional or substituted amino acids relative to the native 
polypeptide. The variant may be derived from a naturally occurring complex component 

20 by appropriately modifying the protein DNA coding sequence to add, remove, and/or to 
modify codons for one or more amino acids at one or more sites of the C-terminus, N- 
terminus, and/or within the native sequence. It is understood that such variants having 
added, substituted and/or additional amino acids retain one or more characterizing portions 
of the native protein, as described above. 

25 A functional derivative of a protein with deleted, inserted and/or substituted amino 

acid residues may be prepared using standard techniques well-known to those of ordinary 
skill in the art. For example, the modified components of the functional derivatives may 
be produced using site-directed mutagenesis techniques (as exemplified by Adelman et. 
al., 1983, DNA 2:183) wherein nucleotides in the DNA coding sequence are modified 

30 such that a modified coding sequence is produced, and thereafter expressing this 

recombinant DNA in a prokaryotic or eukaryotic host cell, using techniques such as those 
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described above. Alternatively, proteins with amino acid deletions, insertions and/or 
substitutions may be conveniently prepared by direct chemical synthesis, using methods 
well-known in the art. The functional derivatives of the proteins typically exhibit the same 
qualitative biological activity as the native proteins. 

5 In addition, the immunoadhesins of the invention may be not just modified 

receptor protein/Ig immunoadhesins, but may also embrace other native receptor protein 
family members, isotypes, and/or other homologous amino acid sequences, e.g., human, 
primate, rodent, canine, feline, bovine, avian, etc. Furthermore, the Ig type used in the 
immunoadhesins can vary, e.g., may assume a different Ig family member identity, within 
10 or without a given species. ICAMs and Igs, for example, are diverse and have well-known 
sequences that one of ordinary skill can exploit to create different immunoadhesins having 
more or less different utility in a given organism to undergo treatment. An illustrative, 
nonexhaustive list of examples of molecules having ICAM-1 homology that can be used to 
create other immunoadhesins include those in the following table. 



15 Table 3 



ACCESSION NO. 


ICAM NAME 


SPECIES 


NP 000192 


Intercellular Adhesion Molecule- 1 (CD54) 

[SEQ ID NO:63] 


Homo sapiens 


AAH03097 


Intercellular Adhesion Molecule ICAM-2 

[SEQ ID NO:641 


Homo sapiens 


NP 002153 


Intercellular Adhesion Molecule 3 Precursor 

rSEQ ID NO:65] 


Homo sapiens 


BAB20325 


TCAM-1 

[SEQ ID NO:66] 


Homo sapiens 


NP 003250 


Intercellular Adhesion Molecule 5 (Telencephalin) 

[SEQ ID NO:67] 


Homo sapiens 


NM 007164 


Mucosal Vascular Address in Cell Adhesion Molecule 
(MADCAM1) 

[SEQ ID NO:68] 


Homo sapiens 


NM 001078 


Vascular Cell Adhesion Molecule 1 (VCAM1) 

[SEQ ID NO:69] 


Homo sapiens 


AAA37875 


MALA-2 

[SEQ ID NO:70] 


Mus musculus 


AAA37876 


Intercellular Adhesion Molecule- 1 Precursor 

[SEQ ID NO:71] 


Mus musculus 


AAG30280 


Intracellular Adhesion Molecule 1 

[SEQ ID NO:72] 


Cricetulus griseus 


AAB39264 


Intercellular Adhesion Molecule-3 

[SEQ ID NO:73] 


Bos taurus 


AAF80287 


Intercellular Adhesion Molecule- 1 Precursor 

[SEQ ID NO:74] 


Sus scrofa 
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ACCESSION NO. 


ICAM NAME 


SPECIES 


AAA 18478 


Telecephalin 


fSEQ ID NO:75] 


Oryctolagus cuniculus 


NP 032345 


Intercellular Adhesion Molecule 5, telencephalin 

[SEQ ID NO:76] 


Mus musculus 


BAB41106 


Cell adhesion molecule TCAM-1 


[SEQ ID NO:77] 


Mus musculus 


NP 067705 


Testicular Cell Adhesion Molecule 1 


[SEQ ID NO:78] 


Rattus norvegicus 


AAG35584 


Nectin-Like Protein 1 


[SEQ ID NO:791 


Mus musculus 


AAC18956 


CD22 Protein 


[SEQ ID NO:80] 


Homo sapiens 


AAA35415 


Intercellular Adhesion Molecule 1 


[SEQ ID NO:81] 


Pan troglodytes 


AAA83206 


89 kDa Protein 


[SEQ ID NO:82] 


Mus musculus 


AAA92551 


Intercellular Adhesion Molecule- 1 


[SEQ ID NO:83] 


Canis familiaris 


AAB06749 


Intercellular Adhesion Molecule- 1 


[SEQ ID NO:84] 


Bos taurus 


AAD13617 


Intercellular Adhesion Molecule- 1 Precursor 

[SEQ ID NO:85] 


Ovis aries 


NP 037099 


Intercellular Adhesion Molecule- 1 


[SEQ ID NO: 86] 


Rattus norvegicus 


AAE22202 


ICAM-4 


[SEQ ID NO: 87] 


Rattus norvegicus 


AAA60392 


cell surface glycoprotein 


[SEQ ID NO:88] 


Homo sapiens 


AAF91086 


Nephrin 


[SEQ ID NO:89] 


Rattus norvegicus 


AAF91087 


Nephrin 


fSEQ ID NO:901 


Mus musculus 



Likewise, numerous heavy chain constant regions of different Ig molecules, both in 
humans and other species, are known that can be substituted in for those specific Ig 
regions of the chimeras described herein. 

5 C. Vectors, Cells and Plants Containing Immunoadhesins 

The present invention also contemplates expression and cloning vectors, cells and 
plants containing the immunoadhesins of the present invention. Technology for isolating 
the genes encoding the various portions of the immunoadhesions are well-known to one of 
skill in the art and can be applied to insert the various required genes into expression 
1 0 vectors and cloning vectors such as those vectors can be introduced into cells and into 
transgenic plants. 
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The present invention contemplates a method of assembling an immunoadhesin 
comprising the steps of: introducing into an organism a DNA segment encoding a 
chimeric receptor protein molecule (e.g., an ICAM-1 molecule), immunoglobulin J chain, 
and introducing into the same organism a DNA encoding a secretory component. The 
5 preferred secretory component contains at least a segment of the amino acid residues 1 to 
residue about 606 of the human polyimmunoglobulin receptor (plgR) amino acid residue 
sequence or analogous amino acid residues from other species (Mostov, Ann Dev. Immu. 
12:63-84 1994). 

The present invention contemplates eukaryotic cells, including plant cells, 

10 containing immunoadhesins of the present invention. The present invention also 
contemplates plant cells that contain nucleotide sequences encoding the various 
components of the immunoadhesin of the present invention. One skilled in the art will 
understand that the nucleotide sequences that encode the secretory component protection 
protein and the chimeric receptor protein molecule and J chain will typically be operably 

1 5 linked to a promoter and present as part of an expression vector or cassette. Typically, if 
the eukaryotic cell used is a plant cell then the promoter used will be a promoter that is 
able to operate in a plant cell. After the chimeric receptor protein genes, secretory 
component genes and J chain genes are isolated, they are typically operatively linked to a 
transcriptional promoter in an expression vector. The present invention also contemplates 

20 expression vectors containing a nucleotide sequence encoding a chimeric receptor protein 
molecule which has been operatively linked to a regulatory sequence for expression. 
These expression vectors place the nucleotide sequence to be expressed in a particular cell 
3' of a promoter sequence which causes the nucleotide sequence to be transcribed and 
expressed. The expression vector may also contain various enhancer sequences which 

25 improve the efficiency of this transcription. In addition, such sequences as terminators, 
polyadenylation (poly A) sites and other 3' end processing signals may be included to 
enhance the amount of nucleotide sequence transcribed within a particular cell. 

Expression of the components in the organism of choice can be derived from an 
independently replicating plasmid, or from a permanent component of the chromosome, or 
30 from any piece of DNA which may transiently give rise to transcripts encoding the 
components. Organisms suitable for transformation can be either prokaryotic or 
eukaryotic. Introduction of the components of the complex can be by direct DNA 
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transformation, by biolistic delivery into the organism, or mediated by another organism 
as for example by the action of recombinant Agrobacterium on plant cells. Expression of 
proteins in transgenic organisms usually requires co-introduction of an appropriate 
promoter element and polyadenylation signal. In one embodiment of the invention, the 
5 promoter element potentially results in the constitutive expression of the components in all 
of the cells of a plant. Constitutive expression occurring in most or all of the cells will 
ensure that precursors can occupy the same cellular endomembrane system as might be 
required for assembly to occur. 

Expression vectors compatible with the host cells, preferably those compatible 
10 with plant cells are used to express the genes of the present invention. Typical expression 
vectors useful for expression of genes in plants are well known in the art and include 
vectors derived from the tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens 
described by Rogers et al., Meth. in Enzymol., 153:253-277 (1987). However, several 
other expression vector systems are known to function in plants. See for example, Verma 
15 et al., PCT Publication No. WO87/00551; and Cocking and Davey, Science, 236:1259- 
1262(1987). 

The expression vectors described above contain expression control elements 
including the promoter. The genes to be expressed are operatively linked to the expression 
vector to allow the promoter sequence to direct RNA polymerase binding and synthesis of 

20 the desired polypeptide coding gene. Useful in expressing the genes are promoters which 
are inducible, viral, synthetic, constitutive, and regulated. The choice of which expression 
vector is used and ultimately to which promoter a nucleotide sequence encoding part of 
the immunoadhesin of the present invention is operatively linked depends directly, as is 
well known in the art, on the functional properties desired, e.g. the location and timing of 

25 protein expression, and the host cell to be transformed, these being limitations inherent in 
the art of constructing recombinant DNA molecules. However, an expression vector 
useful in practicing the present invention is at least capable of directing the replication, 
and preferably also the expression of the polypeptide coding gene included in the DNA 
segment to which it is operatively linked. 

30 In preferred embodiments, the expression vector used to express the genes includes 

a selection marker that is effective in a plant cell, preferably a drug resistance selection 
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marker. A preferred drug resistance marker is the gene whose expression results in 
kanamycin resistance, i.e., the chimeric gene containing the nopaline synthase promoter, 
Tn5 neomycin phosphotransferase II and nopaline synthase 3 ' nontranslated region 
described by Rogers et al., in Methods For Plant Molecular Biology, a Weissbach and H. 
5 Weissbach, eds., Academic Press Inc., San Diego, Calif. (1988). A useful plant expression 
vector is commercially available from Pharmacia, Piscataway, N.J. Expression vectors 
and promoters for expressing foreign proteins in plants have been described in U.S. Pat. 
Nos. 5,188,642; 5,349,124; 5,352,605, and 5,034,322 which are hereby incorporated by 
reference. 

10 A variety of methods have been developed to operatively link DNA to vectors via 

complementary cohesive termini. For instance, complementary homopolymer tracks can 
be added to the DNA segment to be inserted into the vector DNA. The vector and DNA 
segment are then joined by hydrogen bonding between the complementary 
homopolymeric tails to form recombinant DNA molecules. Alternatively, synthetic 

1 5 linkers containing one or more restriction endonuclease sites can be used to join the DNA 
segment to the expression vector. The synthetic linkers are attached to blunt-ended DNA 
segments by incubating the blunt-ended DNA segments with a large excess of synthetic 
linker molecules in the presence of an enzyme that is able to catalyze the ligation of blunt- 
ended DNA molecules, such as bacteriophase T4 DNA ligase. Thus, the products of the 

20 reaction are DNA segments carrying synthetic linker sequences at their ends. These DNA 
segments are then cleaved with the appropriate restriction endonuclease and ligated into an 
expression vector that has been cleaved with an enzyme that produces termini compatible 
with those of the synthetic linker. Synthetic linkers containing a variety of restriction 
endonuclease sites are commercially available from a number of sources including New 

25 England BioLabs, Beverly, Mass. 

The nucleotide sequences encoding the secretory component, J chain, and the 
chimeric receptor protein molecules, e.g., ICAM-1, of the present invention are introduced 
into the same plant cell either directly or by introducing each of the components into a 
plant cell and regenerating a plant and cross-hybridizing the various components to 
30 produce the final plant cell containing all the required components. 
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Any method may be used to introduce the nucleotide sequences encoding the 
components of the immunoadhesins of the present invention into a eukaryotic cell. For 
example, methods for introducing genes into plants include Agrobacterium-mediated plant 
transformation, protoplast transformation, gene transfer into pollen, injection into 
5 reproductive organs and injection into immature embryos. Each of these methods has 
distinct advantages and disadvantages. Thus, one particular method of introducing genes 
into a particular eukaryotic cell or plant species may not necessarily be the most effective 
for another eukaryotic cell or plant species. 

Agrobacterium tumefaciens-mediated transfer is a widely applicable system for 

10 introducing genes into plant cells because the DNA can be introduced into whole plant 
tissues, bypassing the need for regeneration of an intact plant from a protoplast. The use 
of Agrobacterium-mediated expression vectors to introduce DNA into plant cells is well 
known in the art. See, for example, the methods described by Fraley et al., Biotechnology, 
3:629 (1985) and Rogers et al., Methods in Enzymology, 153:253-277 (1987). Further, 

15 the integration of the Ti-DNA is a relatively precise process resulting in few 

rearrangements. The region of DNA to be transferred is defined by the border sequences 
and intervening DNA is usually inserted into the plant genome as described by Spielmann 
et al., Mol. Gen. Genet., 205:34 (1986) and Jorgensen et al., Mol. Gen. Genet., 207:471 
(1987). Modern Agrobacterium transformation vectors are capable of replication in 

20 Escherichia coli as well as Agrobacterium, allowing for convenient manipulations as 
described by Klee et al., in Plant DNA Infectious Agents, T. Hohn and J. Schell, eds., 
Springer-Verlag, New York, pp. 179-203 (1985). Further recent technological advances in 
vectors for Agrobacterium-mediated gene transfer have improved the arrangement of 
genes and restriction sites in the vectors to facilitate construction of vectors capable of 

25 expressing various polypeptide coding genes. The vectors described by Rogers et al., 

Methods in Enzymology, 153:253 (1987), have convenient multi-linker regions flanked by 
a promoter and a polyadenylation site for direct expression of inserted polypeptide coding 
genes and are suitable for present purposes. 

In those plant species where Agrobacterium-mediated transformation is efficient, it 
30 is the method of choice because of the facile and defined nature of the gene transfer. 
Agrobacterium-mediated transformation of leaf disks and other tissues appears to be 

34 



WO 03/064992 



PCT/US02/34197 



limited to plant species that Agrobacterium tumefaciens naturally infects. Thus, 
Agrobacterium-mediated transformation is most efficient in dicotyledonous plants. 

Few monocots appear to be natural hosts for Agrobacterium, although transgenic 
plants have been produced in asparagus using Agrobacterium vectors as described by 
5 Bytebier et al., Proc. Natl. Acad. Sci. U.S.A., 84:5345 (1987). Therefore, commercially 
important cereal grains such as rice, corn, and wheat must be transformed using alternative 
methods. Transformation of plant protoplasts can be achieved using methods based on 
calcium phosphate precipitation, polyethylene glycol treatment, electroporation, and 
combinations of these treatments. See, for example, Potrykus et al., Mol. Gen. Genet., 
10 199:183 (1985); Lorz et al., Mol. Gen. Genet, 199:178 (1985); Fromm et al., Nature, 
319:791 (1986); Uchimiya et al., Mol. Gen. Genet., 204:204 (1986); Callis et al., Genes 
and Development, 1:1 183 (1987); and Marcotte et al., Nature, 335:454 (1988). 

Application of these methods to different plant species depends upon the ability to 
regenerate that particular plant species from protoplasts. Illustrative methods for the 
15 regeneration of cereals from protoplasts are described in Fujimura et al., Plant Tissue 

Culture Letters, 2:74 (1985); Toriyama et al., Theor Appl. Genet., 73:16 (1986); Yamada 
et al., Plant Cell Rep., 4:85 (1986); Abdullah et al., Biotechnology, 4:1087 (1986). 

To transform plant species that cannot be successfully regenerated from 
protoplasts, other ways to introduce DNA into intact cells or tissues can be utilized. For 

20 example, regeneration of cereals from immature embryos or explants can be effected as 
described by Vasil, Biotechnology, 6:397 (1988). In addition, "particle gun" or high- 
velocity microprojectile technology can be utilized. Using such technology, DNA is 
carried through the cell wall and into the cytoplasm on the surface of small (0.525 jum) 
metal particles that have been accelerated to speeds of one to several hundred meters per 

25 second as described in Klein et al., Nature, 327:70 (1987); Klein et al., Proc. Natl. Acad. 
Sci. U.S.A., 85:8502 (1988); and McCabe et al., Biotechnology, 6:923 (1988). The metal 
particles penetrate through several layers of cells and thus allow the transformation of cells 
within tissue explants. Metal particles have been used to successfully transform com cells 
and to produce fertile, stably transformed tobacco and soybean plants. Transformation of 

30 tissue explants eliminates the need for passage through a protoplast stage and thus speeds 
the production of transgenic plants. 



35 



WO 03/064992 



PCT/US02/34197 



DNA can also be introduced into plants by direct DNA transfer into pollen as 
described by Zhou et al., Methods in Enzymology, 101 :433 (1983); D. Hess, Intern Rev. 
Cytol., 107:367 (1987); Luo et al., Plant Mol. Biol. Reporter, 6:165 (1988). Expression of 
polypeptide coding genes can be obtained by injection of the DNA into reproductive 
5 organs of a plant as described by Pena et al., Nature, 325:274 (1987). DNA can also be 
injected directly into the cells of immature embryos and the rehydration of desiccated 
embryos as described by Neuhaus et al., Theor. Appl. Genet., 75:30 (1987); and Benbrook 
et al., in Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 27-54 (1986). 

The regeneration of plants from either single plant protoplasts or various explants 
10 is well known in the art. See, for example, Methods for Plant Molecular Biology, A. 

Weissbach and H. Weissbach, eds., Academic Press, Inc., San Diego, Calif. (1988). This 
regeneration and growth process includes the steps of selection of transformant cells and 
shoots, rooting the transformant shoots and growth of the plantlets in soil. 

The regeneration of plants containing the foreign gene introduced by 
15 Agrobacterium tumefaciens from leaf explants can be achieved as described by Horsch et 
al., Science, 227:1229-1231 (1985). In this procedure, transformants are grown in the 
presence of a selection agent and in a medium that induces the regeneration of shoots in 
the plant species being transformed as described by Fraley et al., Proc. Natl. Acad. Sci. 
U.S.A., 80:4803 (1983). This procedure typically produces shoots within two to four 
20 weeks and these transformant shoots are then transferred to an appropriate root-inducing 
medium containing the selective agent and an antibiotic to prevent bacterial growth. 
Transformant shoots that rooted in the presence of the selective agent to form plantlets are 
then transplanted to soil to allow the production of roots. These procedures will vary 
depending upon the particular plant species employed, such variations being well known 
25 in the art. 

The immunoadhesins of the present invention may be produced in any plant cell 
including plant cells derived from plants that are dicotyledonous or monocotyledonous, 
solanaceous, alfalfa, legumes, or tobacco. 

Transgenic plants of the present invention can be produced from any sexually 
30 crossable plant species that can be transformed using any method known to those skilled 
in the art. Useful plant species are dicotyledons including tobacco, tomato, the legumes, 
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alfalfa, oaks, and maples; monocotyledons including grasses, corn, grains, oats, wheat, and 
barley; and lower plants including gymnosperms, conifers, horsetails, club mosses, 
liverworts, hornworts, mosses, algaes, gametophytes, sporophytes or pteridophytes. 

The present invention also contemplates expressing the immunoadhesins within 
5 eukaryotic cells including mammalian cells. One of skill in the art will understand the 
various systems available for expression of the immunoadhesin in mammalian cells and 
can readily modify those systems to express the immunoadhesins and chimeric protein 
receptor molecules, e.g., ICAM-1 molecules, in those cells. In preferred ICAM 
embodiments, the chimeric ICAM-1, J chain and secretory component molecules of the 
10 present invention are placed in a vector pCDM8 which has been previously described by 
Aruffo, et al., Proc. Natl. Acad. Sci. U.S.A., 84:8573-8577 (1987). The use of the PCDM8 
construct is by no means unique and numerous other eukaryotic expression systems 
systems are available that do not utilize the cog cell expression system and that may be 
used with the chimeric ICAM-1 and other molecules of the immunoadhesin. 

15 D. Compositions Containing Immunoadhesins 

The present invention also contemplates compositions containing an 
immunoadhesin of the present invention together with plant macromolecules or material. 
Typically these plant macromolecules or plant materials are derived from any plant useful 
in the present invention. The plant macromolecules are present together with an 

20 immunoadhesin of the present invention for example, in a plant cell, in an extract of a 

plant cell, or in a plant. Typical plant macromolecules associated with the immunoadhesin 
of the present invention in a composition are ribulose bisphosphate carboxylase, light 
harvesting complex pigments (LHCP), secondary metabolites or chlorophyll. The 
compositions of the present invention have plant material or plant macromolecules in a 

25 concentration of between 0.01% and 99% mass excluding water. Other compositions 
include compositions having the immunoadhesins of the present invention present at a 
concentration of between 1% and 99% mass excluding water. Other compositions include 
immunoadhesins at a concentration of 50% to 90% mass excluding water. 

The compositions of the present invention may contain plant macromolecules at a 
30 concentration of between 0.1% and 5% mass excluding water. Typically the mass present 
in the composition will consist of plant macromolecules and immunoadhesins of the 
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present invention. When the immunoadhesins of the present invention are present at a 
higher or lower concentration the concentration of plant macromolecules present in the 
composition will vary inversely. In other embodiments the composition of plant 
macromolecules are present in a concentration of between 0.12% and 1% mass excluding 
water. 

The present invention contemplates a composition of matter comprising all or part 
of the following: a chimeric protein receptor molecule (e.g., an ICAM-1 molecule), a J 
chain or a secretory component. These components form a complex and are associated as 
was previously described. Typically, the composition also contains molecules derived 
from a plant. This composition may also be obtained after an extraction process yielding 
functional immunoadhesin and plant-derived molecules. 

The extraction method comprises the steps of applying a force to a plant containing 
the complex whereby the apoplastic compartment of the plant is ruptured releasing said 
complex. The force involves shearing as the primary method of releasing the apoplastic 
liquid. 

The whole plant or plant extract contains an admixture of immunoadhesin and 
various other macromolecules of the plant. Among the macromolecules contained in the 
admixture is ribulose bisphosphate carboxylase (RuBisCo) or fragments of RuBisCo. 
Another macromolecule is LHCP. Another molecule is chlorophyll. 

Other useful methods for preparing compositions containing immunoadhesins 
include extraction with various solvents and application of vacuum to the plant material. 
The compositions of the present invention may contain plant macromolecules in a 
concentration of between about 0.1% and 5% mass excluding water. 

The present invention also contemplates therapeutic compositions which may be 
used in the treatment of a patient or animal. Administration of the therapeutic composition 
can be before or after extraction from the plant or other transgenic organism. Once 
extracted the immunoadhesins may also be further purified by conventional techniques 
such as size exclusion, ion exchange, or affinity chromatography. Plant molecules may be 
co-administered with the complex. 
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The present invention also contemplates that the relative proportion of plant- 
derived molecules and animal-derived molecules can vary. Quantities of specific plant 
proteins, such as RuBisCo or chlorophyll may be as little as 0.01% of the mass or as much 
as 99.9% of the mass of the extract, excluding water. 

5 The present invention also contemplates the direct use of the therapeutic plant 

extract containing immunoadhesins without any further purification of the specific 
therapeutic component. Administration may be by topical application, oral ingestion, 
nasal spray or any other method appropriate for delivering the antibody to the mucosal 
target pathogen. 

10 E. Pharmaceutical Compositions, Formulations, And Routes Of 

Administration 

The immunoadhesins described herein can be administered to a patient, preferably 
in the form of a suitable pharmaceutical composition. Such composition may include 
components in addition to, or in lieu of, those described above. The composition 

1 5 preferably exhibits either or both of a therapeutic and prophylactic property when 

administered. The preparation of such compositions can be done according to routine 
methodologies in the art, and may assume any of a variety of forms, e.g., liquid solutions, 
suspensions or emulsifications, and solid forms suitable for inclusion in a liquid prior to 
ingestion. Techniques for the formulation and administration of polypeptides and proteins 

20 may be found in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, PA, 
latest edition. Using these procedures, one of ordinary skill can utilize the 
immunoadhesins of the invention to achieve success without undue experimentation. 

1. Administration Routes 

Suitable routes of administration for the invention include, e.g., oral, nasal, 
25 inhalation, intraocular, phanyngeal, bronchial, transmucosal, or intestinal administration. 
Alternatively, one may administer the compound in a local manner, e.g., via injection or 
other application of the compound to a preferred site of action. 

2. Formulations 
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The pharmaceutical compositions of the present invention may be manufactured in 
a manner that is itself known, e.g., by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 
lyophilizing processes. One or more physiologically acceptable carriers comprising 
5 excipients and/or other auxiliaries can be used to facilitate processing of the active 

compounds into pharmaceutical preparations. Proper formulation is dependent upon the 
particular route of administration chosen. 

For injection, the agents of the invention may be formulated in aqueous solutions, 
preferably in physiologically compatible buffers such as Hanks 's solution, Ringer's 
10 solution, or physiological saline buffer. For transmucosal administration, penetrants 

appropriate to the barrier to be permeated are used in the formulation. Such penetrants are 
generally known in the art. 

For oral administration, the compounds can be formulated readily by combining 
the active compounds with pharmaceutically acceptable carriers well known in the art. 

15 Such carriers enable the compounds of the invention to be formulated as tablets, pills, 

dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion 
by a patient to be treated. Suitable carriers include excipients such as, e.g., fillers such as 
sugars, including lactose, sucrose, mannitol, and/or sorbitol; cellulose preparations such 
as, e.g., maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, 

20 methyl cellulose, hydroxypropylmethyl- cellulose, sodium carboxymethylcellulose, and/or 
polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, such as the 
cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium 
alginate. 

Dragee cores are provided with suitable coatings. For this purpose, concentrated 
25 sugar solutions may be used, which may optionally contain gum arabic, talc, polyvinyl 

pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, 
and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be added to 
the tablets or dragee coatings for identification or to characterize different combinations of 
active compound doses. 

30 Pharmaceutical preparations which can be used orally include push-fit capsules 

made of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as 
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glycerol or sorbitol. The push-fit capsules can contain the active ingredients in admixture 
with filler such as lactose, binders such as starches, and/or lubricants such as talc or 
magnesium stearate and, optionally, stabilizers. In soft capsules, the active compounds 
may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or 
5 liquid polyethylene glycols. In addition, stabilizers may be added. All formulations for 
oral administration should be in dosages suitable for such administration. 

For buccal administration, the compositions may take the form of tablets or 
lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
10 invention may be conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon 
dioxide or other suitable gas. In the case of a pressurized aerosol the dosage unit may be 
determined by providing a valve to deliver a metered amount. Capsules and cartridges of 
15 e.g. gelatin for use in an inhaler or insufflator may be formulated containing a powder mix 
of the compound and a suitable powder base such as lactose or starch. 

Alternatively, the active ingredient may be in powder form for constitution with a 
suitable vehicle, e.g., sterile pyrogen-free water, before use. 

In addition, the compounds may also be formulated as a depot preparation. Such 
20 long acting formulations may be administered by implantation (for example 

subcutaneously or intramuscularly) or by intramuscular injection. Thus, for example, the 
compounds may be formulated with suitable polymeric or hydrophobic materials (for 
example as an emulsion in an acceptable oil) or ion exchange resins, or as sparingly 
soluble derivatives, for example, as a sparingly soluble salt. 

25 Additionally, the compounds may be delivered using a sustained-release system, 

such as semipermeable matrices of solid hydrophobic polymers containing the therapeutic 
agent. Various sustained-release materials have been established and are well known by 
those skilled in the art. Sustained-release capsules may, depending on their chemical 
nature, release the compounds for a few weeks up to over 100 days. Depending on the 
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chemical nature and the biological stability of the therapeutic reagent, additional strategies 
for protein stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase 
carriers or excipients. Examples of such carriers or excipients include but are not limited 
to calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, 
gelatin, and polymers such as polyethylene glycols. 

Pharmaceutically compatible salts may be formed with many acids, including but 
not limited to hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, citric, etc. 
Salts tend to be more soluble in aqueous or other protonic solvents that are the 
corresponding free base forms. In solutions, manipulation of pH is also routinely 
employed for optimizing desired properties. 

3. Determining Effective Dosages and Dosage Regimens 

Pharmaceutical compositions suitable for use in the present invention include 
compositions where the active ingredients are contained in an amount effective to achieve 
an intended purpose, e.g., a therapeutic and/or prophylactic use. A pharmaceutically 
effective amount means an amount of compound effective to prevent, alleviate or 
ameliorate symptoms of disease or prolong the survival of the subject being treated. 
Determination of a pharmaceutically effective amount is well within the capability of 
those skilled in the art, and will typically assume an amount of between about 0.5 
Hg/kg/day and about 500g/kg/day, with individual dosages typically comprising between 
about 1 nanogram and several grams of immunoadhesin. 

For any compound used in the methods of the invention, the therapeutically 
effective dose can be estimated initially from cell culture assays. For example, varying 
dosages can be administered to different animals or cell cultures and compared for effect. 
In this way, one can identify a desired concentration range, and prepare and administer 
such amount accordingly. Optimization is routine for one of ordinary skill in the art. 

The person of skill, in addition to considering pharmaceutical efficacy, also 
considers toxicity according to standard pharmaceutical procedures in cell cultures or 
experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the 
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population) and the ED 50 (the dose therapeutically effective in 50% of the population). 
The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio between LD50 and ED50. Compounds which exhibit high therapeutic 
indices are preferred. The data obtained from these cell culture assays and animal studies 
5 can be used in formulating a range of dosage for use in humans. The dosage of such 
compounds lies preferably within a range of concentrations that include the ED50 with 
little or no toxicity. The dosage may vary within this range depending upon the dosage 
form employed and the route of administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual physician in view of the 
10 patient's condition. (See e.g., Fingl et al., 1975, in "The Pharmacological Basis of 
Therapeutics," Ch. 1 p.l). 

Dosage amount and frequency may be adjusted to provide mucosal levels of 
immunadhesin sufficient to maintain or provide a pharmaceutical effect, e.g., therapeutic 
and/or prophylactic. The minimal effective concentration (MEC) will vary for each 
15 immunadhesin and immunoadhesin formulation, but can be estimated from in vitro and/or 
in vivo data. Dosages necessary to achieve MEC will depend on individual characteristics 
and route of administration. However, assays as described herein can be used to 
determine mucosal concentrations, which can then be further optimized in amount and 
precise formulation. 

20 Dosage intervals can also be determined using MEC value. Compounds can be 

administered using a regimen which maintains mucosal levels above the MEC for 10-90% 
of the time, 30-90% of the time, or, most preferably, 50-90% of the time. 

The compositions may, if desired, be presented in a pack or dispenser device which 
may contain one or more unit dosage forms containing the active ingredient. The pack 

25 may for example comprise metal or plastic foil, such as a blister pack. The pack or 

dispenser device may be accompanied by instructions for administration. The pack or 
dispenser may also be accompanied with a notice associated with the container in form 
prescribed by a governmental agency regulating the manufacture, use, or sale of 
pharmaceuticals, which notice is reflective of approval by the agency of the form of the 

30 immunoadhesin for human or veterinary administration. Such notice, for example, may be 
the labeling approved by the U.S. Food and Drug Administration for prescription drugs, or 
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the approved product insert. Compositions comprising a compound of the invention 
formulated in a compatible pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition, e.g. treatment or 
prophylaxis of a disease mediated by host organism/patient protein receptor molecules. 

5 F. Methods of Treatment and Prevention of ICAM-mediated Afflictions 

A patient in need of therapeutic and/or prophylactic immunoadhesin chimeras of 
the invention, e.g., to counter rhinovirus infection and/or symptoms such as occur with 
colds, can be administered a pharmaceutically effective amount of desired 
immunoadhesin, preferably as part of a pharmaceutical composition determined, 

10 produced, and administered as described above. These formulations and delivery 

modalities can vary widely. Described following are preliminary procedures that can be 
used to deduce effective amounts and toxicity, and which can then be conveniently used to 
determine treatment and prophylaxis parameters and regimens, both in humans and other 
animals. These procedures are illustrative only and are not intended to be limiting of the 

15 invention. Further, these procedures are routine for one of ordinary skill in the art. 

1. Ability of the Immunoadhesin to Reduce Rhinovirus Infectivity 
in Humans: Dose Escalation Tolerance Study 

Immunoadhesins of the invention maybe tested, e.g., using randomized controlled 
trials to determine the effect of administration, e.g., intranasal, of immunoadhesin on 

20 infection. Other administration routes can be used. Various assays exist that can be used 
to monitor effect, e.g., IL-8 response assays assays that evaluate illness symptoms, e.g., 
cold symptoms caused by rhinovirus infection. These studies can evaluate the extent to 
which an immunoadhesin taken by a patient subjects can prevent or treat rhinovirus 
infection. For example, healthy or unhealthy subjects can be administered the 

25 immunoadhesin and evalutated over a time course, e.g., in tandem with rhinovirus 
inoculation and/or illness progression. The clinical protocols used may be based on 
protocols previously used in evaluation of a recombinant soluble ICAM-1 molecule for 
efficacy against rhinovirus infection, or modifications thereto (Turner, et. al., JAMA 
281:1797-804, 1999). 
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Male and female subjects of any species, age, health, or disease state can be 
evaluated The subjects may exhibit a serum neutralizing antibody titer in advance of 
study, which titer may fluctuate in response to infection and immunoadhesin 
administration. 

5 The immunoadhesin of the present invention may be formulated as a buffered 

saline with varying amounts of immunoadhesin within and administered at various 
intervals to a patient. Single ascending dose and multiple ascending dose studies can be 
used to evaluate the safety of the immunoadhesin. In each case, one or more subjects may 
be evaluated at each dosage level, some receiving the immunoadhesin, and one or more 
10 optionally receiving placebo. In either study, multiple dosage levels may be evaluated. 
Dosage levels can vary, but are typically in the nanogram to gram range. 

Dosages may be administered over seconds, minutes, hours, weeks, and months, 
and evaluated for toxicity and/or pharmaceutical effect. 

Safety and toxicity may be assessed, e.g., by visual examination of the nasal 
1 5 mucosa for signs of irritation or inflammation. Blood safety evaluations can also be 
employed according to routine methods and using sensitive assays such as ELIS A to 
determine various blood components, including circulating immunoadhesin and rhinovirus 
quantities. Naval lavage testing may similarly be done according to routine 
methodologies. 

20 Routine statistical analyses and calculations may be employed to determine 

efficacy and toxicity predicted over time courses for single patients and/or for populations 
of patient-recipients.. 

Challenge studies as well known in the art can be used to demonstrate that 
treatment protects against clinical colds and/or reduces cold symptoms after viral 
25 challenge, and using commercially available starting materials such virus, cells, and 
animals. See, e.g., Gwaltney, et. al., Prog. Med. Virol. 39:256-263, 1992. 

The following examples illustrate various aspects and embodiments of the 
disclosed invention. These examples in no way limit the scope of the claimed invention. 
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EXAMPLES 

1. Construction of ICAM-1 Immunoadhesin Expression Cassettes 

A cassette encoding ICAM-1 extracellular domains Dl through D5 was prepared 
by PCR cloning. Specifically, a fragment containing all five extracellular Ig-like domains 
5 of ICAM-1 was amplified from plasmid pCDICl-5D/IgA (Martin, et al. J. Virol. 67:3561- 
8, 1993) using the following oligonucleotide primers: 

5'- 

TCTGTTCCCAGGAACTAGTTTGGCACAGACATCTGTGTCCCCCTCAAAAGTC-3' 
(SEQ ID NO: 6) 

1 0 5 '-C ATACCGGGGACTAGTCACATTCACGGTC ACCTCGCGG-3 ' (SEQ ED 

NO: 7) 

These two primers were designed to introduce Spel sites at the 5' and 3' ends of 
the PCR fragment (underlined nucleotides). PCR was performed with Pfu polymerase 
(Stratagene) to reduce accumulation of errors. The PCR fragment was cloned into the 
15 vector PCRScript (Stratagene), and sequenced before fusing to the human IgA2 cassettes 
(with and without SEKDEL [SEQ ID NO:4] at the carboxy-terminus). 

Constructs for the expression in plants of human J chain and secretory component, 
as well as a human IgA2 heavy chain, were developed. A heavy chain expression cassette 
vector was made and called pSSpHuA2 (See FIG. 1). It contains sequence encoding a 

20 bean legumin signal peptide (Baumlein et al., Nucleic Acids Res. 14 (6), 2707-2720, 

1986). The sequence of bean legumin is provided as GenBank Accession No. X03677, 
and the sequence of the bean legumin signal peptide is SEQ ID NO: 10 (also see Fig. 8) 
and the IgA2m(2) constant region with Spel and SacI sites in between, and the SuperMas 
promoter for driving the expression of a signal peptide and the constant regions of the 

25 human IgA2m(2) heavy-chain. 

The amplified DNAs encoding the first five domains of human ICAM-1, and the 
Fc region of human IgA2m(2) were fused in a plant-expression cassette to make a 
chimeric ICAM-1 molecule expression construct, shown in FIG. 2 A. This was done by 
cloning the fragment encoding the five extracellular domains of ICAM-1 into vector 
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pSSPHuA2 to generate pSSPICAMHuA2. The convenient restriction sites (5' Spel and 3' 
Spe I) allowed the amplified fragment to be inserted between the signal peptide and the 
Cccl domain. In the resulting construct, expression of the chimeric ICAM-1 molecule is 
under the control of the constitutive promoter "superMAS" (Ni et. al., 1995) and the nos 
5 3 ' terminator region. 

The resulting chimeric ICAM-1 molecule construct contains no variable region. 
Upon translation of the mRNA, signal peptide cleavage is predicted to deposit the ICAM- 
1 -heavy chain fusion into the plant cell's endoplasmic reticulum (ER). DNA encoding an 
ER retention signal (RSEKDEL, SEQ ED NO: 5) was appended to the 3' end of the heavy- 

10 chain in order to boost the expression level of the construct. The amino acid sequence 

SEKDEL (SEQ ID NO: 4) is the consensus signal sequence for retention of proteins in the 
endoplasmic reticulum in plant cells. This sequence has been shown to enhance 
accumulation levels of antibodies in plants (Schouten et al., Plant Molecular Biology 
30:781-793,1996). The amino acid sequence of the chimeric ICAM-1 molecule construct 

15 is shown in FIG. 2B. The DNA sequence and translational frame of the construct was 
verified before it was used for particle bombardment. 

It has been shown recently that assembly of J chain with IgA takes place in the 
Golgi apparatus (Yoo et al., J. Biol. Chem. 274:33771-33777, 1999), and so constructions 
of heavy chain without SEKDEL (SEQ ID NO: 4) have been made as well. The ICAM-1 
20 fragment was cloned into an expression cassette containing the IgA2m(2) constant region 
without SEKDEL (SEQ ID NO: 4). 

2. Expression of Assembled ICAM-l Immunoadhesin in Plants 

A. Immunoadhesin Expression Vectors 

The plasmid pSSPICAMHuA2 [SEQ ID NO:9 and FIG. 8 A] is 6313 bp in length. 

25 Nucleotides 49-1 165 represent the Superpromoter (Ni et al., Plant Journal 7:661-676, 

1995). Nucleotides 1 166-3662 comprise a sequence encoding a human ICAM-1 /human 
IgA2m(2) constant hybrid with linker sequences. A consensus Kozak sequence (Kozak, 
Cell 44(2):283-92, 1986) is included (nt 1186-1192) to enhance translation initiation, as 
well as the signal peptide from V. faba legumin (nt 1 189-1257; Baumlein et al., Nucleic 

30 Acids Reg. 14(6):2707-2720 (1986). The sequence of the human IgA2m(2) constant 
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region (nt 3663-3633) has been previously published (Chintalacharuvu, et aL, J. 1mm. 152: 
5299-5304, 1994). A sequence encoding the endoplasmic reticulum retention signal 
SEKDEL [SEQ ID NO:4] is appended to the end of the heavy Chain (nt 3634-3654). 
Nucleotides 3663-3933 derive from the nopaline synthase 3' end (transcription 
5 termination and polyadenlyation signal; Depicker et aL, 1982). The remainder of the 
plasmid derives from the vector pSP72 (Promega). 

The plasmid pSHuJ (FIG. 8C) is 4283 bp in length. Nucleotides 14-1 136 represent 
the Superpromoter (Ni et aL, Plant Journal 7:661-676, 1995) and nucleotides 1 137-1648 
are shown in FIG. 8 [SEQ ID NO:l 1] and comprise a sequence encoding the human J 

10 Chain including the native signal peptide (Max and Korsmeyer, J 1mm. 152:5299-5304, 

1985) along with linker sequences. A consensus Kozak sequence (Kozak, Cell 44(2):283- 
92, 1986) is included (nt 1162-1168) to enhance translation initiation. Nucleotides 1649- 
1902 derive from the nopaline synthase 3' end (transcription termination and 
polyadenlyation signal; Depicker et aL, J Mol Appl Genet l(6):561-73, 1982). The 

15 remainder of the plasmid derives from the vector pSP72 (Promega). 

The plasmid pSHuSC (FIG. 8D) is 5650 bp in length. Nucleotides 13-1 136 are 
derived from the Superpromoter (Ni et aL, Plant Journal 7:661-676, 1995), and nucleotides 
1 137-2981 comprise a sequence encoding the human Secretory Component including the 
native signal peptide (Krajci, et aL, Biochem. and Biophys. Res. Comm 158:783, 1994) 

20 along with linker sequences [SEQ ID NO: 12]. A consensus Kozak sequence (Kozak, Cell 
44(2):283-92, 1986) is included (nt 1 151-1 157) to enhance translation initiation. 
Nucleotides 2982-3236 derive from the nopaline synthase 3' end, providing a transcription 
termination and polyadenlyation signal, described in Depicker et aL, J Mol Appl Genet 
l(6):561-73 (1982). The remainder of the plasmid derives from the vector pSP72 

25 (Promega). 

The plasmid pBMSP-1 [SEQ ID NO: 13 and FIG. 8E] is derived from pGPTV- 
KAN. Becker et aL, in Plant Molecular Biology 20, 1 195-1 197, (1992), describe new 
plant binary vectors with selectable markers located proximal to the left T-DNA border, 
and the sequences outside of the left and right borders. Nucleotides 18-187 of pBMSP-1 
30 represent the right T-DNA border, and nucleotides 181 1-775 represent the superMAS 
promoter. Nucleotides 2393-2663 represent the NOS promoter (Depicker et aL, J Mol 
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Appl Genet l(6):561-73, 1982), nucleotides 2698-3492 encode the NPTII gene (conferring 
resistance to kanamycin), and nucleotides 351 1-3733 are the polyadenylation signal from 
A. tumefaciens gene 7 (Gielen et al., Embo J 3:835-46, 1984). Nucleotides 1768-976 
encode the NPTII gene, and nucleotides 4317-4464 represent the left T-DNA border. 

5 The plasmid pBMSP-lspJSC [SEQ ID NO:14 and FIG. 8F] is a derivative of 

pBMSP, containing both J and SC under control of superpromoter. In this plasmid, 
nucleotides 1-149 represent the left T-DNA border. Nucleotides 955-733 are the 
polyadenylation signal from A. tumefaciens gene, nucleotides 1768-976 encode the NPTII 
gene (conferring resistance to kanamycin), and nucleotides 2073-1803 represent the NOS 

10 promoter. Nucleotides 2635-3768 represent the superMAS promoter, nucleotides 3774- 
5595 encode the Human Secretory component, and nucleotides 5603-5857 represent the 
NOS polyadenylation signal. Nucleotides 5880-6991 represent the superMAS promoter, 
nucleotides 7007-7490 encode the Human Joining Chain, and nucleotides 7504-7757 
represent the NOS polyadenylation signal. Nucleotides 7886-8057 represent the right T- 

15 DNA border. 

The plasmid pGPTV-HPT, encoding the enzyme conferring hygromycin 
resistance, is available commercially from the Max-Planck-Institut fur 
Ziichtungsforschung (Germany). It is described by Becker in Plant Molecular Biology 20, 
1195-1197(1992). 

20 B. Plant Transformation and ICAM-1 Immunoadhesin Expression 

in Plants 

The expression cassettes described above were used to produce the assembled 
immunoadhesin in plants. Plasmids pSSPICAMHuA2, pSHuJ, pSHuSC and pBMSP- 
lwere co-bombarded into tobacco leaf tissue (N. tabacum cultivar Xanthi) and 
25 transformed microcalli were selected on nutrient agar in the presence of kanamycin. 
Individual microcalli, indicative of independent transformation events, were dissected 
from the parent tissue and propagated on nutrient agar with kanamycin. 

The callus tissues were screened for transgene expression. Callus #7132 was 
shown to express a chimeric ICAM-1 immunoadhesin and J chain by immunob lotting and 
30 PCR (data not shown). This callus did not possess DNA encoding the SC. The callus 
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grew well in culture and, upon accumulation of sufficient mass, #7132 was bombarded 
again, this time with two of the plasmids described above, PBMSP-1 SpJSC, containing 
expression cassettes for both the J chain and SC and pGPTV-HPT, containing an 
expression cassette for the hpt I gene which confers hygromycin resistance. After a period 
5 of selection and growth on nutrient agar, several independent transformants were 

identified, by immunoblotting, that expressed the chimeric ICAM-1 molecule, the J chain 
and SC in several states of assembly. 

FIG. 3 illustrates the expression of the chimeric ICAM-1 molecule in 
independently transformed tobacco calli. FIG. 3 A shows immunoblots of non-reducing 

10 SDS-polyacrylamide gels on which samples containing different transformed tobacco calli 
(C) and aqueous extracts (Aq) were run and probed for the presence of human ICAM. The 
solubility of the immunoadhesin assured us that extraction could be easily performed, and 
the similarity of signals leads us to believe in the reproducibility of expression. FIG. 3B 
shows immunoblots of nonreducing SDS-polyacrylamide gels containing various 

15 fractions of partially purified immunoadhesin from callus Rhi 107-1 1 . The blots were 
probed with antibodies against human ICAM (-ICAM), human IgA heavy chain (~a), 
human secretory component (~SC) and human J chain (~J). Secondary, enzyme- 
conjugated antibodies were employed as necessary to label immuno-positive bands with 
alkaline phosphatase. The specificity of immuno-blotting was further verified by a failure 

20 to detect immuno-positive bands in extracts of non-expressing calli (not shown). The 
expected MW for a dimerized chimeric ICAM-1 molecule, without glycosylation, is 
173,318; this form is likely present in the band migrating just below the 250kD marker 
since it is immuno-positive for ICAM-1 and heavy-chain. This band is also immuno- 
positive for SC (total expected MW of -248 kD) but not for J chain which is somewhat 

25 unexpected given the canonical pathway for SIgA assembly, which involves 2 cell types 
(in mammalian) and requires the presence of J chain prior to assembly of SC. A 
tetrameric immunoadhesin, containing a single molecule of J chain and a single molecule 
of SC, has an expected MW of -440 kD, prior to glycosylation. Several species with 
molecular weights well in excess of 200 kD, immuno-positive with all four probes, are 

30 readily apparent. 

Bombardment with DNA-coated microprojectiles is used to produce stable 
transformants in both plants and animals (reviewed by Sanford et al., Meth. Enz. 217:483- 
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509,1993). Particle-mediated transformation with the vectors encoding the 
immunoadhesin of the present invention was performed using the PDS-1000/He particle 
acceleration device, manufactured by Bio-Rad. The PDS-1000/He particle acceleration 
device system uses Helium pressure to accelerate DNA-coated microparticles toward 
5 target cells. The physical nature of the technique makes it extremely versatile and easy to 
use. We have successfully transformed tobacco with all four components of a secretory 
IgA simultaneously. 

The basic biolistic procedure was performed as follows: A stock suspension of 
microprojectiles was prepared by mixing 60 mg of particles in 1 ml of absolute ethanol. 

10 This suspension was vortexed and 25-50 |il was removed and added to a sterile 

microcentrifuge tube. After microcentrifuging for 30 seconds the ethanol was removed 
and the pellet resuspended in 1 ml sterile water and centrifuged for 5 minutes. The water 
was then removed and the pellet resuspended in 25-50 jal of DNA solution containing a 
mixture of plasmid DNAs, usually, but not always in equimolar amounts. The amount of 

15 plasmid added varied between 0.5 ng and 1 jig per preparation. The following were added 
sequentially: 220 pi of sterile water, 250 pi of 2.5M CaC12, and 50 jxl of 0.1M spermidine. 
This mixture was vortexed for at least 1 0 min and then centrifuged for 5 min. The 
supernatant was removed and the DNA/microprojectile precipitated in 600 pi of absolute 
ethanol, mixed and centrifuged Imin. The ethanol was removed and the pellet 

20 resuspended in 36 pi of ethanol. Ten pi of the suspension was applied as evenly as 

possible onto the center of a macrocarrier sheet made of Kapton (DuPont) and the ethanol 
was evaporated. The macrocarrier sheet and a rupture disk were placed in the unit. A 
petri dish containing surface-sterilized tobacco leaves was placed below the stopping 
screen. The chamber was evacuated to 28-29mm Hg and the target was bombarded once. 

25 The protocol has been optimized for tobacco, but is optimized for other plants as well by 
varying parameters such as He pressure, quantity of coated particles, distance between the 
macrocarrier and the stopping screen and flying distance from the stopping screen to the 
tissue. 

Expression cassettes for chimeric ICAM-1 molecules were also assembled in 
30 binary vectors for use in Agrobacterium-mediated transformation. An Agrobacterium 
binary vector designed for expression of both human J chain and human secretory 
component, as well as kanamycin resistance, was introduced into A. tumefaciens strain 
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LBA4404. The chimeric ICAM/IgA molecule in another binary vector was also used to 
transform LBA4404. Overnight cultures of both strains were used for simultaneous "co- 
cultivation" with leaf pieces of tobacco, according to a standard protocol (Horsch et ah, 
Science 227:1229-1231, 1985). 

5 A standard protocol for regeneration of both bombarded and Agrobacterium- 

transformed tobacco leaf disks was used (Horsch et al., Science 227:1229-1231, 1985). 
Because transformed plants, regenerated from bombarded tissue, frequently undergo gene- 
silencing upon maturation, transgenic tobacco plants were prepared via Agrobacterium 
mediated transformation, which gives a higher yield of expressing, mature plants. 

10 3. Purification of Assembled ICAM-1 Immunoadhesin 

The immunoadhesin expressed according to Examples 3 was purified. Calli were 
grown in large amounts to facilitate the development of extraction procedures. A partial 
purification schedule provided a stable concentrate, available in a variety of buffer 
conditions, for investigation of subsequent chromatographic techniques for the further 

15 purification of the immunoadhesin (See FIG. 3). Calli were extracted in a juicer, which 
crushes tissue between two stainless-steel gears, while bathed in a buffer containing 
sodium citrate (0.6 M, pH 7.4) and urea (final concentration of 2 M). The juice (~1 ml/g 
fresh weight) was precipitated, after coarse filtration through cheesecloth, with 0.67 
volumes of saturated ammonium sulfate. A green pellet was collected after centrifugation 

20 and thoroughly extracted, in a small volume of 50 mM sodium citrate (pH 6.6), with a 
Dounce homogenizer. After additional centrifugation, a clear brown supernatant was 
collected and partially purified, during buffer exchange in a de-salting mode, by passage 
through a Sephadex G-100 column. The desalting/buffer exchange step has allowed 
preparation of a partially purified concentrate (-0.2 ml/ g fresh weight callus) in a 

25 desirable buffer; the G-100 column was eluted with 0.25 X phosphate buffered saline. 
This eluate appeared to be stable for at least 10 days at 2-8°C. 

4. The ICAM-lJmmunoadhesin Inhibits Human Rhinovirus Infectivity 

The infectivity of cells by human rhinovirus was demonstrated to be inhibited by 
the immunoadhesin prepared according to Example 3. Callus extract prepared according 
30 to Example 3 successfully competed for binding of an anti-ICAM monoclonal antibody to 
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soluble ICAM-1. FIG. 4 shows the data from an enzyme-linked immunosorbent assay 
(ELISA). For the assay, 96-well plates were coated with 0.25 |ig soluble ICAM-1 /ml. 
The squares represent the increasing concentrations of sIC AM and the circles represent the 
increasing amounts of callus extract (sterile filtered fraction from G-100) used to compete 
5 with the adhered ICAM for a constant amount of a mouse (anti-human ICAM) antibody. 
After washing the wells, adherent mouse antibody was detected with an anti-mouse 
antibody conjugated to horseradish peroxidase. Adherent enzyme activity was measured 
at 490 nm, with ortho-phenylene diamine as a substrate. The data (squares, sICAM; 
circles, Extract) are well described by sigmoids of the form OD490 = y =y0 + a/[l+e A -{(x- 

10 x0)/b}], where a = y max, yO = y min, b = the slope of the rapidly changing portion of the 
curve and xO = the value of x at the 50% response level. Relative to soluble ICAM-1, the 
immunoadhesin extract tested here contains the equivalent of -250 jag ICAM/ml; this is an 
overestimate due to expected avidity effects of the dimeric and tetrameric assemblies of 
the ICAM-1 -heavy-chain fusions. Thus, this ELISA demonstrated that the 

15 immunoadhesin competes with soluble ICAM-1 for binding to an anti-ICAM mAb. 

The competitive ELISA allows for quantitative assessment of the recovery of 
activity by comparing the normalized amounts of various fractions required to give a 50% 
response. Upon purification, the titer of a immunoadhesin preparation may be expressed 
as a reciprocal dilution, or the number of milliliters to which a milligram of 
20 immunoadhesin must be diluted in order to give a 50 % response. This ELISA will 
facilitate the development of a purification process for the immunoadhesin. 

A cytopathic effect assay (CPE) demonstrated the specific ability of the partially 
purified immunoadhesin to inhibit the infectivity of human cells by human rhino virus 
(FIG. 5). Rhinovirus serotype HRV-39 was pre-incubated with human ICAM-1, an 
25 ICAM/IgA fusion (gift of Dr. Tim Springer), or with extracts from calli either expressing 
our ICAM-l/SIgA immunoadhesin or another, different, antibody before plating each of 
the mixtures with HeLa S3 cells at 33°C. After 3 days, viable cells were fixed and stained 
with a methanolic solution of Crystal Violet; the optical density at 570 nm provides a 
proportional measure of cell viability. 

30 Two extracts derived from Rhi 107-1 1, containing the immunoadhesin, clearly 

inhibited the virus' ability to infect and kill HeLa S3 cells (FIG. 5 A, right side-up and 
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upside-down triangles). Because the extracts were only partially purified, we also assayed 
a similarly prepared extract that contained a human IgA2m(2) directed against 
Doxorubicin, a chemotherapeutic agent. That extract, containing a similar 
immunoglobulin with an unrelated binding specificity, was unable to inhibit the infectivity 
5 of the rhino virus and demonstrates that expression of the ICAM-1 -heavy-chain fusion 
confers specificity to the inhibition. The CPE assay demonstrated, as expected, the 
differential ability of souluble ICAM-1 and an (ICl-5/IgA; Martin, et al., 1993) to inhibit 
viral infectivity (FIG. 5B). The insert in Figure 5B is the scale expansion in the range of 
the IC 50 for soluble ICAM-1 (1.35 ^ig/ml) and for the ICI-5/IgA (0.12 [ig/ml; 1 1.3 fold 
10 less). 

5. Production and Purification of 1 mmu noadhcsins for Clinical and 
Toxicological Studies 

Production of sufficient immunoadhesin for the proposed clinical and toxicological 
needs is performed by making transgenic tobacco plants. The transgenic plants which 

15 express the immunoadhesin (without an ER retention signal) are generated by 

Agrobacterium-mediated transformation. The absence of an ER retention signal is 
anticipated to enhance assembly since the nascent SIgA is processed through the entire 
Golgi apparatus, including, in particular, the trans-Golgi, where SC is covalently linked to 
dlgA as suggested by pulse-chase experiments (Chintalacharuvu & Morrison, 

20 Immunotechnology 4: 1 65- 1 74, 1 999). Because Agrobacterium-mediated transformation 
is much more likely to generate plants with consistent levels of transgene expression, it is 
likely that progeny of these plants will be used for the production of clinical grade 
immunoadhesin. 

In order to maximize expression levels, and create a true-breeding line, it is 
25 desirable to create homozygous plants. The highest producing plants (generation TO) can 
self-fertilize in the greenhouse before seed is collected. One quarter of the Tl plants are 
expected to be homozygous. These are grown in the greenhouse and seed samples from 
several plants are separately germinated on medium containing kanamycin. All the 
progeny (T2) from homozygous positive plants are expected to be green. Some of the 
30 progeny of heterozygous plants are expected to be white or yellowish. Homozygosity is 
confirmed by back-crossing to wild-type and immunoblotting extracts of the progeny. 
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Harvesting and processing may be continuously meshed during a production 
campaign, especially since multiple harvests may be obtained from a single planting, i.e. 
plants cut to soil level for one harvest are regrown for subsequent harvests. In developing 
a sense of scale for the production of immunoadhesin it is necessary to decide on the 
5 required amount of finished immunoadhesin, account for expression levels (mg 

immunoadhesin present/ kg fresh weight tobacco), know the growth rate of the plants and 
the expected weight of the average plant, and the overall yield of the purification schedule 
(set at 20%). Setting the overall need at 3 g of finished immunoadhesin requires preparing 
for 4 harvests, each with an expected yield of 1 g of finished immunoadhesin. 

10 Given these targets and parameters, the necessary number of plants and hence the 

space requirements for plant growth is determined. FIG. 6 shows an evaluation of the 
production necessities for making 1 gram of finished Immunoadhesin. In this diagram, the 
number of plants needed for 1 g of immunoadhesin, at 20% yield, at expected levels of 
expression and plant weight is illustrated. At different levels of immunoadhesin 

15 expression (mg/kg fresh weight) and overall recovery (set at 20%), the weight of each 
plant, and so the total number of plants, may be determined for a specified production 
target (1 g/harvest) within a window (dotted square) of reasonable possibilities. The 
number of required plants decreases, inversely, with the number of specified growth and 
re-growth periods. The expected biomass production, a function of time and growth 

20 conditions, influences the time to harvest and the time between harvests. These growth 
periods can be adjusted to the realities of the purification schedule by staggering planting 
and harvesting dates. For example, 1 g of finished immunoadhesin from plants with a 
reasonable expression level, of 100 mg of immunoadhesin/kg fresh weight, require 250 
plants when harvested at a weight of 200 g/plant (-80 days post germination). At this 

25 scale, these plants require about 10 m 2 of growing space and are harvested twice over 150 
days. 

Processing 50+ kg of biomass at a time requires several moderately large-scale 
operations which all have counter-parts in the food-processing industry. These include 
bulk materials handling, size reduction, juicing and filtration. A Vincent Press and a 
30 Durco filtration system are used to efficiently process these quantities. The juicing step 
employs a proven and simple buffer of sodium citrate and urea. These components buffer 
the extract, help prevent the oxidation of phenolics and their association with proteins 
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(Gegenheimer, Methods in Enzymology 182:174-193, 1990; Loomis, Methods in 
Enzymology, 31:528-544, 1974; Van Sumere, et al., The Chemistry and Biochemistry of 
Plant Proteins, 1975.) and ensure the solubility of the immunoadhesin during a subsequent 
acid precipitation. 

5 Filtration of acid-insoluble lipid and protein (-90% of the total) is followed by 

tangential flow ultrafiltration to concentrate the immunoadhesin and to remove small 
proteins, especially phenolics. Diafiltration enhances the removal of small molecules and 
exchanges the buffer in preparation for short-term storage and subsequent 
chromatography. Either SP-Sepharose (binding at pH 5.0 or below) or Q-Sepharose 

10 (binding at pH 5.5 or above) are among the ion-exchanges that can be used for filtering 
immunoadhesin. They are readily available, scalable, robust and have high capacities. In 
particular, they are available for expanded-bed formats, which reduce the stringency of 
prior filtration steps. Cation-exchange chromatography, which can be more selective than 
anion-exchange chromatography, is used first. The immunoadhesin is purified from the 

1 5 several species of protein potentially present, to the point where at least 95% of the protein 
is in the form of ICAM-l/IgA, ICAM-l/dlgA or ICAM-l/SIgA, as the presence of di- and 
tetra-valent ICAM-1 domains are critical for potent anti -viral activity. Purified 
immunoadhesin is then tested for acceptable levels of endotoxin, alkaloids such as nicotine 
and for bio-burden. In addition, potency levels (defined by ELISA and CPE assays), 

20 protein concentration, pH and appearance are monitored. Subsequently, the stability of the 
clinical lots of immunoadhesin is determined, to ensure that patients receive fully potent 
immunoadhesin. Even partially purified extracts have been found to be stable for 10 days 
when refrigerated. The titer and potency of clinically formulated immunoadhesin (in 
phosphate-buffered saline), when stored at -20°C, 2-8°C, and at 37°C, over a period of 3 to 

25 6 months, is also tested. 

6. The Immunoadhesins Have Plant-Specific Glycosylation 

The immunoadhesins produced are analyzed to determine the pattern of 
glycosylation present. Cabanes-Macheteau et al.,Glycobiology 9(4):365-372 (1999), 
demonstrated the presence of several glycosyl moieties, typical of plants, on a plant- 
30 expressed antibody construct. Their methods are used to demonstrate that the 

immunoadhesins produced according to Example 1 , 2 and 3 have a plant-specific 
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glycosylation pattern. We anticipate that this diversity will also be a source of variability 
for immunoadhesin. Since crude extracts have been shown to have anti-viral activity in 
vitro (data not shown), glycosylation, as such, does not appear to affect potency. N-linked 
glycosylation (FIG. 2 shows that there are fifteen potential sites on the chimeric ICAM-1 
5 molecule alone) probably contributes to the diversity of bands seen in immuno-blots. 

Immunoadhesin preparations are digested with N-Glycosidase A, before blotting, showing 
that the difference in banding patterns collapse into fewer, discrete bands. In this way, 
glyco forms are initially characterized with reducing and non-reducing polyacrylamide 
gels. In addition, digested and mock-digested fractions are tested in the CPE assay and 
10 competition ELISA, demonstrating the effect of N-linked glycosylation on potency and 
titer in vitro. 

7. The ICAM-1 Immunoadhesin Inactivates Human Rhinovirus 

The immunoadhesin prepared according to Examples 1, 2 and 3 is assayed for its 
ability and to inactivate HRV by binding to the virus, blocking virus entry, and inducing 
15 the formation of empty virus capsids. To measure binding of the immunoadhesin to HRV, 
the immunoadhesin is incubated with [ 3 H]leucine-labeled HRV-39 for 30 min and then 
added to HeLa cells for 1 hr. After washing, cells and bound virus are detached with 
Triton X-100 and [ 3 H] measured in a scintillation counter. 

Inactivation of HRV-39 by incubation with the immunoadhesin is compared with 
20 HRV inactivation by sICAM-1 . HRV-39 is not directly inactivated to a significant extent 
(<0.5 log 10 reduction in infectivity) by incubation with monomelic sICAM-1, while 
incubation with ICl-5D/IgA reduced infectivity approximately 1.0 loglO (Arruda, et al., 
Antimicrob. Agents Chemother. 36:1 186-1 191, 1992; Crump, et al., Antimicrob. Agents 
Chemother. 38:1425-7, 1994). In order to test the ability of the immunoadhesin to 
25 inactivate HRV-39, 106 50% tissue culture infective doses (TCID50) of HRV-39 are 

incubated in medium containing a concentration of sICAM-1 or immunoadhesin equal to 
ten times the IC50 of each molecule for that virus, or in plain medium, for 1 hr at 33°C on 
a rocker platform. Each virus-immunoadhesin or virus-medium mixture are then diluted 
serially in ten- fold dilutions, and the titer determined on HeLa cells in 96-well plates. 

30 The effect of the immunoadhesin on HRV attachment to host cells is tested by 

inoculating HeLa cells with HRV-39 at a MOI of 0.3 in the presence or absence of the 



57 



WO 03/064992 



PCT/US02/34197 



immunoadhesin. Absorbance proceeds for one hour at 4°C, the cells are washed, and 
media is replaced plus or minus the immunoadhesin. Cells are incubated for ten hours at 
33°C (to allow one round of replication), and virus are harvested by freeze/thawing the 
cells. The virus is titered on HeLa cells. 

5 ICAM-IgA (ICl-5D/IgA) is more efficient than Sicam-1 at inducing 

conformational changes in HRV, leading to the formation of empty, non-infectious viral 
particles (Martin, et al. J. Virol. 67:3561-8, 1993). To examine the ability of the 
immunoadhesin produced according to Examples 1, 2 and 3 to induce conformational 
changes in HRV, causing release of viral RNA, purified immunoadhesin is incubated with 

10 [ 3 H]leucine-labeled HRV-39 for 30 min and then the virus is overlayed onto a 5 to 30% 
sucrose gradient. Following centrifugation for 90 min at 40,000 rpm, fractions are 
collected, [ 3 H] measured, and fractions assessed for infectivity. (Intact HRV sediments at 
149S on a sucrose gradient while empty capsids lacking RNA sediments at 75 S (Martin, et 
al. J. Virol. 67:3561-8, 1993)). Due to its increased valence, we expect the ICAM/SIgA 

1 5 immunoadhesin is more efficient at inducing empty non-infectious particles than ICAM- 
IgA. 

The inhibitory effect of purified immunoadhesin on a panel of both major and 
minor (that do not use ICAM-1 as a receptor) HRV serotypes will be examined using the 
CPE assay. The ability of ICAM-1 to inhibit HRV infection varies among viral isolates. 
20 It has been shown (Crump, et al., Antimicrob. Agents Chemother. 38:1425-7, 1994) that 
the EC50 for sICAM-1 varies from 0.6 |ig/ml to >32 ng/ml when tested on a panel of HRV 
major receptor serotypes assay using HeLa cells. Our panel includes nine major serotypes 
(HRV-3, -13, -14, -16, -23, -39, -68, -73, and -80) and the minor receptor serotype HRV- 
1A. 

25 8, Clinical Studies Demonstrating the Ability of the ICAM-1 

Immunoadhesin to Reduce Infectivity in Humans: Dose Escalation 
Tolerance Study 

The immunoadhesin of the present invention is tested in two randomized 
controlled trials to determine the effect of intranasal administration of the immunoadhesin 
30 on infection, IL-8 response, and illness in experimental rhinovirus colds. These two 
studies evaluate the immunoadhesin taken by subjects before or after rhinovirus 
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inoculation. The clinical protocols used here are based on protocols previously used by in 
evaluation of a recombinant soluble ICAM-1 molecule for efficacy against rhino virus 
infection (Turner, et aL, JAMA 281:1797-804, 1999). 

A. Subjects 

5 Subjects are recruited from university communities at the University of Virginia, 

Charlottesville. Subjects are required to be in good health, non-smokers, and between the 
ages of 18 and 60 years. Subjects are excluded if they have a history of allergic disease or 
nonallergic rhinitis, abnormal nasal anatomy or mucosa, or a respiratory tract infection in 
the previous 2 weeks. Pregnant or lactating women or women not taking medically 
10 approved birth control are also excluded. In the experimental virus challenge study (Phase 
I/II, see below), subjects are required to be susceptible to the study virus as evidenced by a 
serum neutralizing antibody titer of 1 :4 or less to the virus, determined within 90 days of 
the start of the trial. 

B. Study Medication 

15 The immunoadhesin of the present invention is formulated as a phosphate-buffered 

saline (PBS) spray solution containing 2.6 mg/ml. The placebo consists of PBS and is 
identical in appearance to the active preparation. The solutions are administered using a 
medication bottle equipped with a metered nasal spray pump. The pump delivers 70 ju.1 of 
solution containing 1 83 ^g of the immunoadhesin with each spray. The medication is 

20 administered as two sprays per nostril, six times daily (at 3-hour intervals) for a total daily 
dose of 4.4 mg. This is the same dose, in mg protein/day, as was used for soluble ICAM-1 
in the tremacamra study infection (Turner, et al., JAMA 281 : 1797-804, 1999). A mole of 
the immunoadhesin has about twice the mass as a mole of sICAM-1. However, given the 
differences in in vitro activity between sICAM-1 and ICAM/IgA fusions, the 

25 immunoadhesin is many fold more effective on a molar basis than sICAM-1 . Thus, this 
amount is a conservative calculation of what is necessary. This amount is used, except in 
the event that the dose escalation study reveals problems at this dose. 

C. Study Design 
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Single ascending dose and multiple ascending dose studies are used to evaluate the 
safety of the immunoadhesin. In each case, three subjects are evaluated at each dosage 
level, two receiving the immunoadhesin and one receiving placebo. In the single 
ascending dose study, four dosage levels are evaluated. The lowest individual dose is half 
5 the anticipated dose to be used in the challenge study, and the highest individual dose is 
twice the anticipated challenge study dose. The dosage levels are as follows: one spray in 
each nostril (366 (ig total), two sprays in each nostril (732 jug total), three sprays in each 
nostril (1098 jig total), four sprays in each nostril (1464 \xg total). 

The same dosage levels are used in the multiple ascending dose study. Subjects 
10 receive doses every three hours (six times per day) for five days. In both studies subjects 
are evaluated at each dosage level, staggering the start of each subsequent level until it is 
clear that there is no acute toxicity at the previous level. All subjects return for a single 
dose 21 days after the first dose, and then for a follow-up at six weeks (for determination 
of serum antibody against the immunoadhesin). 

15 A separate group of twelve subjects is given one dose of two sprays in each nostril 

(732 ^ig total), and nasal lavage is done at 1, 2, 4, 8 and 16 hours (two subjects at each 
time point). Washings are assayed at Panorama Research by ELISA for the 
immunoadhesin in order to calculate its in vivo half-life. The total amount of the 
immunoadhesin to be used in the dose escalation and half-life determination studies (on a 

20 total of 28 subjects) will be approximately 270 mg. 

D. Safety Evaluations 

In addition to routine adverse event recording, the safety of the immunoadhesin is 
assessed in three ways. First, prior to the first dose and after the last dose the investigators 
perform a visual examination of the nasal mucosa, in particular looking for signs of 

25 irritation or inflammation. Any visible changes are noted. Second, standard blood safety 
evaluations are done on samples collected prior to treatment and after the last dose on 
study days 1, 4, and 8 (and 21 in the multiple ascending dose study). Third, serum 
samples are saved, frozen, and used to determine if the immunoadhesin is able to pass 
through the nasal mucosa into the blood. This is accomplished in two ways. First, the 

30 presence the immunoadhesin in serum samples is measured by ELISA. In this assay, anti- 
human IgA antibodies adsorbed to microtiter plates capture any the immunoadhesin in the 



60 



WO 03/064992 



PCT/US02/34197 



serum, which are detected by an anti-ICAM antibody. The sensitivity of the assay is 
determined using normal human serum samples spiked with known concentrations of the 
immunoadhesin. Alternatively, anti-ICAM antibodies can be adsorbed to plates to capture 
the immunoadhesin in the serum, that would be detected by anti-IgA. Second, the 
5 presence of an immune response to the immunoadhesin is assayed with an ELISA method 
that uses the immunoadhesin adsorbed to microtiter plates. Any anti-immunoadhesin 
antibodies in the serum bind, and are detected with anti-human IgG or anti-human IgM. 
Pre-treatment and post-treatment serum samples are compared, and any change in titer is 
considered evidence of uptake of the immunoadhesin. If there is any positive evidence of 
10 anti-immunoadhesin antibodies, additional assays will be done to distinguish between anti- 
ICAM- 1 and anti-IgA activity. 

Patients are screened for the development of an allergic reaction to the 
immunoadhesin. (In previous studies, there were no episodes of adverse reactions with 
soluble ICAM applied topically in the nose or plantibodies applied topically in the oral 
15 cavity.) Individuals exhibiting symptoms of nasal allergy are tested for anti- 

immunoadhesin-specific IgE antibodies in nasal lavage fluids using a sensitive two-step 
ELISA (R&D Systems). 

E. Statistical Analysis. 

The sample size for these studies is based on previous studies using the rhinovirus 
20 challenge model. The sample size planned for the protection studies should be adequate to 
detect a reduction in the incidence of clinical colds from 75% in the placebo groups to 
25% in the active treatment groups at 1 -sided levels of a = .05 and 1-p = .80. In addition, 
the sample size should be adequate to detect a change in the total symptom score of 5 units 
assuming an SD of 5.8 units. 

25 9. Clinical Studies Demonstrating the Ability of the Immunoadhesin to 

Reduce Infectivity in Humans: Challenge Studies 

Challenge studies are used to demonstrate that treatment with the immunoadhesin 
of the present invention protect against clinical colds or reduce cold symptoms after viral 
challenge. 
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A. Challenge Virus. 

The challenge virus used for this study is rhinovirus 39 (HRV-39). Rhinovirus 
type 39 is a major group of rhinovirus that requires ICAM-1 for attachment to cells. The 
challenge virus pool is safety- tested according to consensus guidelines (Gwaltney, et al., 
5 Prog. Med. Virol. 39:256-263, 1992). All subjects are inoculated with approximately 200 
median tissue culture infective dose (TCID50). The virus are administered as drops in two 
inocula of 250 jxl per nostril given approximately 15 minutes apart while the subjects are 
supine. 
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Note: In both s tudies on days 1-5, doses are given at hours 0 , 3, 6, 9, 12, and 15 
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B. Study Design 

Two randomized rhinovirus challenge studies are performed (see Table 4). The 
same formulation of the immunoadhesin of the present invention is evaluated in pre- 
15 inoculation and post-inoculation studies. In both studies, medication is administered as six 
doses each day for five days. Subjects are randomly assigned to receive either the 
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immunoadhesin or matching placebo at the time of enrollment into each study. The study 
is blinded and all clinical trial personnel, subjects, and employees of Panorama Research 
remain blinded until all data are collected. 

In the pre-inoculation study, medications are started four hours (two doses) prior to 
5 viral challenge. The virus challenge is administered one hour after the second dose of the 
immunoadhesin (or placebo) and the four remaining doses of study medication for the first 
day are given as scheduled. In this study eighteen subjects receive the active treatment 
and eighteen subjects receive placebo. 

In the post-inoculation study, medications begin 24 hours after virus challenge. 

10 This timepoint was chosen because it has been used in other studies of protection from 

virus challenge, and because cold symptoms are clearly present (Harris & Gwaltney, Clin. 
Infect. Dis. 23:1287-90, 1996). Virus challenge in this study is administered in the 
morning of study day 0 approximately 24 hours prior to the first dose of study medication 
on the morning of study day 1. In this study, 36 subjects receive the active treatment and 

15 18 subj ects receive placebo . 

Subjects are isolated in individual hotel rooms from study day 0 (the day of virus 
challenge) to study day 6. On each of these days a symptom score and a nasal lavage for 
virus isolation are done in the morning prior to the first dose of medication and a second 
symptom score is done each evening. On study day 6, subjects are released from isolation 
20 but continue to record symptom scores each evening through day 14. The subjects return 
to the study site on study day 21, when a final serum sample for detection of anti- 
immunoadhesin antibodies will be collected. The total amount of immunoadhesin to be 
used in the two virus challenge studies (on a total of 54 subjects) is approximately 1200 
mg. 

25 C. Viral Isolation 

Virus shedding is detected by virus isolation in cell culture. Nasal wash specimens 
are collected by instillation of 5 ml of 0.9% saline into each nostril. This wash is then 
expelled into a plastic cup and kept chilled for one to two hours until it is processed for 
viral cultures. Immunoadhesin is removed from the specimens by treatment with anti- 
30 ICAM-1 antibody adsorbed to an agarose support (Affi-Gel 10, Bio-Rad Laboratories, 
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Hercules, CA). A portion of each processed specimen is stored at -80 °C, and another 
portion is inoculated into two tubes of HeLa-1 cells, a HeLa cell line enriched for the 
production of ICAM-1 Arruda, et al., J. Clin. Microb. 34:1277-1279, 1996). Rhinovirus 
are identified by the development of typical cytopathic effect. Subjects with a positive 
5 viral culture on any of the postchallenge study days are considered infected. Viral titers in 
the specimens stored at -80 °C are determined by culturing serial ten-fold dilutions in 
microtiter plates of HeLa-1 cells. 

Antibody to the challenge virus are detected by serum neutralizing titers done 
using standard methods Gwaltney, et al, Diagnostic Procedures for Viral Rickettsial and 
10 Chlamydial Infections, p. 579-614, American Public Health Association). Serum 

specimens for antibody testing are collected during screening, immediately prior to virus 
challenge (acute), and again 21 days later (convalescent). Subjects with at least a four- 
fold rise in antibody titer to the challenge virus when the convalescent serum sample is 
compared with the acute serum sample are considered infected. 

15 D. Evaluation of Illness Severity 

Illness severity is assessed as previously described (Turner, et al., JAMA 
281:1797-804, 1999). Symptom scores are recorded prior to virus challenge (baseline) 
and twice each day at approximately twelve-hour intervals for the next 6 days. On study 
days 7 through 14 each subject records his/her symptom score once per day in the evening. 

20 At each evaluation, subjects are asked to judge the maximum severity of the following 
eight symptoms in the interval since the last symptom evaluation: sneezing, rhinorrhea, 
nasal obstruction, sore throat, cough, headache, malaise, and chilliness. Each symptom is 
assigned a severity score of 0 to 3 corresponding to a report of symptom severity of 
absent, mild, moderate, or severe. If symptoms are present at baseline, the baseline 

25 symptom score will be subtracted from the reported symptom score. The higher of the 
two daily evaluations are taken as the daily symptom score for each symptom. The daily 
symptom scores for the eight individual symptoms are summed to yield the total daily 
symptom score. The total daily symptom scores for the first 5 days after virus challenge 
(study days 1-5) are summed and on the evening of study day 5, all subjects are asked, 

30 "Do you feel you have had a cold?" Subjects who had a total symptom score of at least 6 
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and either at least three days of rhinorrhea or the subjective impression that they had a 
cold are defined as having a clinical cold. 

The weight of expelled nasal secretions is determined on days 1-7 by providing all 
subjects with packets of preweighed nasal tissues. After the tissues are used they are 
5 stored in an airtight plastic bag. Each morning the used tissues, together with any unused 
tissues from the original packet, are collected and weighed. 

E. IL-8 Assay. 

Recent studies have suggested that the host inflammatory response, particularly 
interleukin 8 (IL-8), may play a role in the pathogenesis of common cold symptoms due to 
10 rhinovirus infection. Concentrations of IL-8 in nasal lavage are determined with a 
commercially available ELISA (R&D Systems, Minneapolis, Minn) as previously 
described (Turner, et al., JAMA 281 : 1797-804, 1999). 

F. Safety Evaluations 

The same evaluations are done in the challenge study as in the dose escalation 
1 5 study described in Example 8. 

G. Statistical Analysis 

Statistical analysis is performed similarly as to that described for the dose 
escalation study described in Example 8. 

The foregoing examples and discussion, while predominantly addressed to ICAM- 
20 1 immunoadhesins, can be readily adapted by one of skill to achieve and implement the 
use of other types of immunoadhesins active against other types or subtypes of virus and 
bacterial pathogens. The following examples illustrate anti-bacterial immunoadhesin 
embodiments making use of the anthrax toxin receptor (ATR) as receptor protein. 

10. Construction of ATR Immunoadhesin Expression Cassettes 

25 A cassette encoding a portion of the extracellular domains of human anthrax toxin 

receptor (ATR) is prepared by PCR cloning. Specifically, a fragment of 523 bp, encoding 
amino acids 44-216 (the so-called von Willebrand factor type A domain) is amplified from 
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plasmid ATR (Bradley et al., 2001), or from plasmid TEM8 (St Croix et al., 2000) using 
the following oligonucleotide primers: 

5 ' -GACCTGTACTTCA - 3 ' 

(SEQ ID NO: 91) 

5 

5 ' -GAGCTCAAAATTGAGTGGATGATGCCTTGCAGAG-3 ' 
(SEQ ID NO: 92) 

The second primer (SEQ ID NO: 92) is designed to introduce a Sac I site at 
the 3' end of the coding region of the ATR extracellular domain (solid underline). PCR is 
10 performed with Pfu polymerase (Stratagene) to reduce accumulation of errors. A second 
fragment of 124 bp, which includes a 5' untranslated region and a plant signal peptide, is 
amplified from plasmid 5ATG-TOPO#4 (which is a PCR clone of a plant-optimized 5' 
untranslated region and signal peptide in the Invitrogen cloning vector pCR4-TOPO), 
using the following oligonucleotide primers: 

15 5 ' - GGTACCA CTTCTCTCAATCCAACTTTC-3 1 

(SEQ ID NO: 93) 



5' -GTCCAA^ ' 
(SEQ ID NO: 94) 

20 The first primer (SEQ ID NO: 93) is designed to introduce a Kpn I site at the 5' 

end of the PCR fragment (solid underline). The two PCR fragments have 20 nt of 
complementary sequence (dotted underlines). The two PCR fragments are mixed together, 
and a fragment of 626 bp is amplified using SEQ ID NO: 93 and SEQ ID NO: 92. The 
resulting PCR fragment is cloned into the vector PCRScript (Stratagene), and sequenced 

25 before cloning between Kpn I and Sac I sites in the vector pMSP-coICAM, resulting in 

plasmid pMSP-ATR-IgA2. This results in a genetic fusion of the extracellular domain of 
ATR and the constant region of human IgA2. This human IgA2 constant region has been 
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synthesized to use codons optimal for expression in tobacco cells. The full nucleotide and 
amino acid sequence of the ATR-IgA2 fusion (the immunoadhesin) is shown in Figure 10. 
In the resulting construct, expression of the chimeric ATR-IgA2 molecule is under the 
control of the constitutive promoter "superMAS" (Ni et al., 1995) and the ags 3' 
5 terminator region. 

The entire expression cassette (promoter + ATR-IgA2 + terminator) is removed 
from pMSP-ATR-IgA2 with the restriction enzyme Asc I, and cloned into the binary 
Agrobacterium Ti plasmid vector pGPTV-kan-ocs, resulting in plasmid pGPTV-kan-ocs- 
ATR-IgA2. The vector pGPTV-kan-ocs is derived from pGPTV-kan (Becker et al., 1992), 

10 which was modified in the following manner. The sequence between the Eco RI and Hind 
III sites of pGPTV-kan, including the entire uid A gene, was removed and replaced with 
the ocs 3' terminator region (MacDonald et al., 1991) oriented toward the npt II gene, plus 
the restriction sites for Asc I and Sac I. The purpose of this terminator adjacent to the right 
border of the T-DNA is to eliminate transcriptional interference with the transgene due to 

15 transcription originating in the plant DNA outside of the right border (Ingelbrecht et al., 
1991). 

Sequence between the T-DNA borders of the plasmid pGPTV-kan-ocs-ATR-IgA2 
is shown in Figure 1 1 . Sequence outside the left and right borders are as described 
(Becker et al., 1992). Nucleotides 18-187 represent the right T-DNA border. Nucleotides 

20 31 1-630 represent the ocs 3' terminator region. Nucleotides 927-1976 represent the 

superMAS promoter. Nucleotides 1990-2017 represent a 5' untranslated region from the 
Nicotiana sylvestris psaDb gene (Yamamoto et al., 1995). The context around the 
initiation ATG (nucleotides 2012-2026) was designed to match that found in highly 
expressed plant genes (Sawant et al., 1999). Nucleotides 2018-2086 comprise a sequence 

25 encoding a modified version of the signal peptide of Vicia faba legumin (Baumlein et al., 
1986). Nucleotides 2087-2605 comprise a sequence encoding the von Willebrand factor 
type A domain of ATR (Bradley et al., 2001). Nucleotides 2606-3631 comprise a 
sequence encoding the human IgA2m(2) constant region (Chintalacharuvu et al., 1994). 
Nucleotides 3794-4108 derive from the agropine synthase (ags) terminator. Nucleotides 

30 4530-4800 represent the NOS promoter (Depicker et al., 1982). Nucleotides 4835-5626 
encode the npt II gene (conferring resistance to kanamycin). Nucleotides 5648-5870 are 
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the polyadenylation signal from A. tumefactions gene 7 (Gielen et al., 1984). Nucleotides 
6454-6602 represent the left T-DNA border. 

A construct for the expression in plants of human J chain and secretory component 
has also been developed. This construct, pGPTV-hpt-ocs-35SJ/SC, is based on the vector 
5 pGPTV-hpt-ocs, derived from pGPTV-hpt in the same manner as described for pGPTV- 
kan-ocs above. Sequence between the T-DNA borders of the plasmid pGPTV-hpt-ocs- 
35SJ/SC is shown in Figure 12. Sequence outside the left and right borders are as 
described (Becker et al., 1992). Nucleotides 1-149 represent the left T-DNA border. 
Nucleotides 733-955 (complement) represent the polyadenylation signal from A 

10 tumefactions gene 7 (Gielen et al., 1984). Nucleotides 980-2002 (complement) represent 
the hpt gene (conferring resistance to hygromycin). Nucleotides 2049-2318 (complement) 
represent the NOS promoter (Depicker et al., 1982). Nucleotides 2898-3230 represent the 
cauliflower mosaic virus (CaMV) 35S promoter driving expression of the human secretory 
component gene including it's native signal peptide (nucleotides 3236-5056), and 

15 nucleotides 5060-5445 represent the polyadenylation signal from the pea rbcS-E9 gene 
(Mogen et al., 1992). Nucleotides 5457-5788 represent a second copy of the CaMV 35S 
promoter driving expression of the human Joining (J) chain gene including it's native 
signal peptide (nucleotides 5797-6273), and nucleotides 6281-6494 represent the gene 7 
terminator. Nucleotides 6501-6819 (complement) represent the ocs 3' terminator region. 

20 Nucleotides 6944-7 1 1 3 represent the right T-DNA border. 

11. Plant Transformation and ATR Immunoadhesin Expression in Plants 

The expression cassettes described above are used to produce the assembled 
immunoadhesin in plants, via Agrobacterium-mediated transformation. Plasmids pGPTV- 
hpt-ocs-35SJ/SC and pGPTV-kan-ocs-ATR-IgA2 are introduced separately into A 
25 tumefaciens strain LBA4404. Overnight cultures of both strains are used for simultaneous 
"co-cultivation" with leaf pieces of tobacco, according to a standard protocol (Horsch et 
al., 1985). Transformed plant tissue is selected on regeneration medium containing both 
kanamycin (100 jig/mL) and hygromycin (25 ^ig/mL). 

Plantlets that regenerate in the presence of antibiotic are screened for transgene 
30 expression. This is accomplished by preparing extracts of leaf tissue in phosphate 

buffered saline (PBS) and spotting clarified extracts on nitrocellulose paper. These "dot" 
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blots are probed with alkaline-phosphatase-conjugated antisera specific for human IgA, J 
chain or secretory component. Plants that test positive on this first screen are subjected for 
further screens involving western blotting and PCR. The ATR-IgA2 immunoadhesin is 
expected to have a subunit MW of 59 kDa. Due to natural dimerization of the heavy chain 
5 constant region, dimers of ~1 18 kDa are also expected to form. These dimers further 

dimerize within the plant cell in the presence of J chain, forming a molecule of ~252 kDa. 
With the addition of secretory component, a molecular species of -320 kDa is observed. 

The presence of a signal peptide in the chimeric heavy chain, J chain and secretory 
component constructs is important for assembly into a multimeric immunoadhesin. Upon 
10 translation of the mRNAs, signal peptide cleavage is predicted to deposit the each protein 
into the plant cell's endoplasmic reticulum (ER). Assembly into a multimeric 
immunoadhesin is expected to take place in the ER and golgi bodies, and the assembled 
molecule is then secreted from the cell. 

12. Purification of Assembled ATR Immunoadhesin 

1 5 Purification of Assembled ATR Immunoadhesin can be accomplished essentially 

as described for the ICAM-1 mmunoadhesin of Example 3, supra. 

13. The ATR Immunoadhesin Inhibits Toxin Action on Mammalian Cells 

The expression cassettes described above are used to produce the assembled 
immunoadhesin, which is purified from plant extracts. The purified immunoadhesin is 

20 used to protect CHO-K1 cells from being killed in a simple bioassay. CHO-K1 cells have 
the receptor to which PA binds on their cell surfaces, but they are not sensitive to the 
toxin. They are killed when challenged with PA and LF N -DTA, a fusion protein 
composed of the N-terminal 255 amino acids of LF linked to the catalytic A chain of 
diptheria toxin. This recombinant toxin exploits the same LF-PA-receptor interactions 

25 that are required for the binding and entry of the native LF and OF proteins. To test the 
protective effect of the immunoadhesin, CHO-K1 cells are mixed with an increasing 
amount of ATR-IgA2 in the presence of a constant (toxic) amount of PA and LFn-DTA, 
and the subsequent effect on protein synthesis is measured. ATR-IgA2 is an effective 
inhibitor of toxin action, inhibiting toxin action at a lower molar concentration than 

30 soluble ATR. 
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14. The ATR Immunoadhesin Inhibits Toxin Action in Human Subjects 

The purified immunoadhesin is prepared in a pharmaceutical^ acceptable buffer 
and is administered to human subjects infected with Anthrax. The route of administration 
may be either as an inhaled aerosol or as an injection. Subjects in late stages of infection 
5 who would normally die are protected from toxin action by the immunoadhesin. 

15. Construction of an Alternative ATR Immunoadhesin Expression 
Cassette 

A cassette encoding the entire extracellular portion of human ATR (amino acids 
24-320) is prepared by PCR cloning. Specifically, a fragment of 878 bp is amplified from 
10 plasmid ATR (Bradley et al., 2001), or from plasmid TEM8 (St Croix et al., 2000) using 
the following oligonucleotide primers: 

5 ' -GGGGGACGCAGGG^ ' 
(SEQ ID NO: 95) 

15 5 ' - GAGCTCCCGTCAGAACAGTGTGTGGTGGTG - 3 ' 

(SEQ ID NO: 96) 



The second primer (SEQ ID NO: 96) is designed to introduce a Sac I site at the 3' 
end of the coding region of the ATR extracellular domain (solid underline). PCR is 
20 performed with Pfu polymerase (Stratagene) to reduce accumulation of errors. A second 
fragment of 121 bp, which includes a 5' untranslated region and a plant signal peptide, is 
amplified from plasmid 6ATG-TOPO#4, using the following oligonucleotide primers: 

5 ' - GGTACCA CTTCTCTCAATCCAACTTTC - 3 ' 

(SEQ ID NO: 93) 

25 5 ' -ATCCTCCCTGC - 3 ' 

(SEQ ID NO: 97) 
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The first primer (SEQ ID NO: 93) is designed to introduce a Kpn I site at the 5' 
end of the PCR fragment (solid underline). The two PCR fragments have 20 nt of 
complementary sequence (dotted underlines). The two PCR fragments are mixed together, 
5 and a fragment of 981 bp is amplified using SEQ ID NO: 93 and SEQ ID NO: 96. The 
resulting PCR fragment is cloned into a plant expression cassette to form a genetic fusion 
with human IgA2 in the same manner as the partial ATR extracellular domain (Example 
1). 

An alternate construction using this same method would amplify amino acids 41- 

10 227. 

* * * 



The foregoing examples are not limiting and merely representative of various 
aspects and embodiments of the present invention. All documents cited are indicative of 
15 the levels of skill in the art to which the invention pertains. The disclosure of each 
document is incorporated by reference herein to the same extent as if each had been 
incorporated by reference in its entirety individually, although none of the documents is 
admitted to be prior art. 

One skilled in the art will readily appreciate that the present invention is well 
20 adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
those inherent therein. The methods and compositions described illustrate preferred 
embodiments, are exemplary, and are not intended as limitations on the scope of the 
invention. Certain modifications and other uses will occur to those skilled in the art, and 
are encompassed within the spirit of the invention, as defined by the scope of the claims. 

25 It will be readily apparent to one skilled in the art that varying substitutions and 

modifications may be made to the invention without departing from the scope and spirit of 
the invention. Thus, such additional embodiments are within the scope of the invention 
and the following claims. 
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The invention illustratively described herein suitably may be practiced in the 
absence of any element or elements, limitation or limitations which is not specifically 
disclosed herein.. The terms and expressions which have been employed are used as terms 
of description and not of limitation, and there is no intention in the use of such terms and 
5 expressions of excluding any equivalents of the features shown and described, or portions 
thereof. It is recognized that various modifications are possible within the scope of the 
invention claimed. Thus, it should be understood that although the present invention has 
been specifically disclosed by preferred embodiments, optional features, modifications and 
variations of the concepts herein disclosed may be resorted to by those skilled in the art, 
10 and that such modifications and variations are considered to be within the scope of this 
invention as defined by the description and the appended claims. 

In addition, where features or aspects of the invention are described in terms of 
Markush groups or other grouping of alternatives, those skilled in the art will recognize 
that the invention is also thereby described in terms of any individual member or subgroup 
15 of members of the Markush group or other group, and exclusions of individual members 
as appropriate. 

Other embodiments are within the following claims. 

We claim: 
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CLAIMS 

1 . An immunoadhesin comprising a chimeric toxin receptor protein, said toxin 
receptor protein comprising: 

a toxin receptor protein linked to at least a portion of an immunoglobulin heavy 
5 chain; and 

J chain and secretory component associated with said chimeric toxin receptor 
protein. 

2. The immunoadhesin of claim 1 wherein said toxin receptor protein is an Anthrax 
toxin receptor protein comprised of 

10 the extracellular domain of Anthrax toxin receptor or any portion thereof. 

3. The immunoadhesin of claim 1 wherein said immunoglobulin heavy chain is 
selected from the group consisting of 

IgA, IgAl, IgA2, IgM, and chimeric immunoglobulin heavy chains. 

4. The immunoadhesin of claim 2 comprising at least one additional chimeric 
1 5 Anthrax toxin receptor protein. 

5. The immunoadhesin of claim 2 wherein said Anthrax toxin receptor protein is 
comprised of any portion of the extracellular domain of Anthrax toxin receptor 
protein; and said immunoglobulin heavy chain comprises at least a portion of an 
IgA2 heavy chain. 

20 6. The immunoadhesin of claim 1 expressed in transgenic plants. 

7. The immunoadhesin of claim 1 expressed in monocotyledonous plants. 

8. The immunoadhesin of claim 1 expressed in dicotyledonous plants. 

9. The immunoadhesin of claim 1 wherein all proteins are human. 

10. The immunoadhesin of claim 1 expressed in heterologous cells derived from plants 
25 vertebrates, or invertebrates. 
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11. The immunoadhesin of claim 1 expressed in mammalian cells. 

12. The immunoadhesin of claim 1 expressed in hairy root cultures 

13. The immunoadhesin of claim 1 expressed in plant cells in tissue culture. 

14. An immunoadhesin comprising a chimeric bacterial or viral toxin receptor protein, 
5 said toxin receptor protein comprising: a toxin receptor protein linked to at least a 

portion of an immunoglobulin heavy chain, wherein said immunoadhesin has 
plant-specific glycosylation. 

15. The immunoadhesin of claim 14 wherein said toxin receptor protein is an Anthrax 
toxin receptor protein. 

10 16. The immunoadhesin of claim 15 wherein said immunoadhesin further comprises a 
J chain and secretory component associated with said chimeric Anthrax toxin 
receptor protein. 

1 7. The immunoadhesin of claim 1 5 wherein said Anthrax toxin receptor protein is 
comprised of the extracellular domain of Anthrax toxin receptor or any portion 

15 thereof. 

18. The immunoadhesin of cliam 14 wherein said immunoglobulin heavy chain is 
selected from the goup of IgA, IgA u IgA 2 , IgGi, IgG 2 , IgG 3 , IgG 4 , IgD, IgE, IgM, 
and a chimeric immunoglobulin heavy chain. 

19. The immunoadhesin of claim 14 or 15 comprising at least one additional chimeric 
20 toxin receptor protein. 

20. The immunoadhesin of claim 14 or 15 wherein said toxin receptor protein is 
comprised of any portion of the extracellular domain of said toxin receptor protein; 
and said immunoglobulin heavy chain comprises at least a portion of an IgA2 
heavy chain. 

25 21 . The immunoadhesin of claim 14 wherein all proteins are human or associated with 
a human host during infection and/or pathagenesis. 
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22. The immunoadhesin of claim 14 expressed in heterologous cells derived from 
plants vertebrates, or invertebrates. 

23. The immunoadhesin of claim 14 expressed in hairy root cultures 

24. The immunoadhesin of claim 14 expressed in plant cells in tissue culture. 
5 25. The immunoadhesin of claim 14 expressed in transgenic plants. 

26. The immunoadhesin of claim 14 expressed in monocotyledonous plants. 

27. The immunoadhesin of claim 14 expressed in dicotyledonous plants. 

28. A composition comprising an immunoadhesin and plant material, wherein said 
immunoadhesin comprises a chimeric toxin receptor protein, said chimeric toxin 

10 receptor protein linked to at least a portion of an immunoglobulin heavy chain. 

29. The composition of claim 27 further comprising a J chain and secretory component 
with said chimeric toxin receptor protein. 

30. A composition of claim 27 wherein said chimeric toxin receptor protein is 
comprised of any portion of the extracellular domain of said toxin receptor protein; 

1 5 and said immunoadhesin has plant-specific glycosylation. 

31. A composition of claim 27 wherein said immunoglobulin heavy chain is selected 
from the group consisting of IgA, IgA u IgA 2 , IgGi, IgG 2 , IgG 3 , IgG 4 , IgD, IgE, 
IgM, and a chimeric immunoglobulin heavy chain. 

32. A composition of claim 27 comprising at least one additional chimeric toxin 
20 receptor protein. 

33. A composition of claim 27 wherein said toxin receptor protein is comprised of any 
portion of the extracellular domain of said toxin receptor protein; and said 
immunoglobulin heavy chain comprises at least a portion of an IgA2 heavy chain. 

34. The composition of any of claims 28-33 wherein said toxin receptor protein is an 
25 Anthrax toxin receptor protein. 
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35. A method for reducing the binding of a viral or bacterial antigen to a host cell, said 
method comprising: contacting said antigen with an immunoadhesin of claim 1,14 
or 27, and wherein said immunoadhesin binds to said antigen and reduces the toxic 
activity thereof. 

36. A method for reducing mortality and morbidity of a viral or bacterial pathogen, 
said method comprising: contacting an antigen of said viral or bacterial pathogen 
with an immunoadhesin of claim 1, 14 or 27, and wherein said immunoadhesin 
binds to said antigen and reduces the toxic activity thereof. 

37. A method for reducing mortality and morbidity due to a bacterial or viral toxin in a 
human subject, said method comprising: administering to said subject an effective 
amount of an immunoadhesin of claim 1, 14 or 27, and wherein said 
immunoadhesin binds to said toxin and reduces the toxic activity thereof 

38. The method of any of claims 35-37 wherein said toxin is an anthrax PA toxin. 

39. A pharmaceutical composition comprising an immunoadhesin of claim 1, 14 or 27 
in a pharmaceutically acceptable buffer. 

40. An expression vector comprising a gene encoding a chimeric toxin receptor protein 
operatively linked to a plant promoter, said chimeric toxin receptor protein linked 
to at least a portion of an immunoglobulin heavy chain. 

41 . The expression vector claim 40 wherein said toxin receptor protein is an anthrax 
toxin receptor protein. 

42. The immunoadhesin, composition, or method of any of claims 1, 14, 19, 20, 28, 
35, 36, 37, 39, or 40 wehrein said chimeric receptor protein comprises ICAM-1. 
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FIGURE 3 

A. RMW C AqC Aq CAqC Aq C Aq 




B. RS J G SG RS J G SG RS J G SG RS J G SG 




HCAM ~a ~SC -J 



Expression of ICAM-1 -SlgA in independently transformed tobacco calli. Immunoblots, of 
non-reducing SDS-polyacrylamide gels, of different calli (C), and aqueous extracts (Aq) derived from 
them, probed for the presence of human ICAM (A). The MW markers are indicated and the 
reference standard (R) was a mixture (-75 ng each) of human ICAM (-75 kD) and human SlgA 
(»250 kD). The solubility of the piantibody assured us that extraction could be easily performed and 
the similarity of signals leads us to believe in the reproducibility of expression. B. Immuno-blots of 
non-reducing SDS-polyacrylamide gels containing various fractions of partially purified piantibody from 
callus Rhi107-1 1. J = juice, G = G-100 fraction, SG =sterile filtered G-100 fraction (used in CPE 
assay) and RS = a mixture of reference standards of human SlgA (75 ng) and human ICAM-1 (75 ng). 
Blots were probed with antibodies against human ICAM (-ICAM). human IgA heavy chain (-a), 
human secretory component (-SC) and human J chain (-J). Secondary, enzyme -conjugated 
antibodies were employed as necessary to label immuno-positive bands with alkaline phosphatase. 
The specificity of immuno-blotting was further verified by a failure to detect immuno-positive bands in 
extracts of non-expresssing calli (not shown). The expected MW for dimerized ICAM-1 -heavy-chain, 
without glycosylation, is 173,318; this form is likely present in the band migrating just below the 250kD 
marker since it is immuno-positive for ICAM-1 and heavy-chain. This band is also immuno-positive for 
SC (total expected MW of -248 kD) but not for J chain which is somewhat unexpected given the 
canonical pathway for SlgA assembly, which involves 2 cell types (mammalian) and requires the 
presence of J chain prior to assembly of SC. A tetrameric ICAM-1 -heavy-chain fusion, containing a 
single molecule of J chain and a single molecule of SC, has an expected MW of -440 kD. prior to 
glycosylation. ■ Several species with. molecular weights well in excess of 200 kD, immuno-positive with 
all four probes, are readily apparent. 
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FIGURE 7A 

I. HUMAN IG ALPHA- 1 CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>sp|P018 76|ALCl_HUMAN IG ALPHA- 1 CHAIN C REGION - Homo sapiens (Human). 

10 20 30 40 50 60 

I I I I I I 

ASPTSPKVFP LSLCSTQPDG NWIACLVQG FFPQEPLSVT WSESGQGVTA RNFPPSQDAS 

70 80 90 100 110 120 

I I I i I I 

GDLYTTSSQL TLPATQCLAG KSVTCHVKHY TNPSQDVTVP CPVPSTPPTP SPSTPPTPSP 

130 140 150 160 170 180 

l I I I I I 

SCCHPRLSLH R PALED LLLG SEANLTCTLT GLRDASGVTF TWTPSSGKSA VQGPPERDLC 



190 200 210 220 230 240 

I I I I I I 

GCYSVSSVLP GCAEPWNHGK TFTCTAAYPE SKTPLTATLS KSGNTFRPEV HLLPPPSEEL 
250 260 270 280 290 300 

I - I I I I I 

ALNELVTLTC LARGFSPKDV LVR^QGSQE LPREKYLTWA SRQEPSQGTT TFAVTSILRV 
310 320 330 340 350 

1 I I t E 

AAEDWKKGDT F S CM VGHEAL PLAFTQKTID RLAGKPTHVN VSWMAEVDG TCY 

[SEQUENCE ID NO: 16] 

CODING SEQUENCE 

9 -1 

catccccgac cagccccaag gtctccccgc tgagcctctg cagcacccag ccagatggga 60 
acgtggtcat cgcctgcctg gcccagggct tcttccccca ggagccactc agtgtgacct 120 
ggagcgaaag cggacagggc gtgaccgcca gaaacctccc acccagccag gatgcccccg 18 0 

gggacctgta caccacgagc agccagctga ccctgccggc cacacagcgc ctagccggca 240 
agcccgcgac acgccacgtg aagcactaca cgaatcccag ccaggatgeg accgcgccct 300 
gcccagttcc cccaacccca cctaccccat ctccctcaac tccacctacc ccatctccct 360 
catgctgcca cccccgactg tcactgcacc gaccggccct cgaggacccg ctcccaggtc 420 
cagaagcgaa cctcacgtgc acactgaccg gcctgagaga tgcctcaggt gtcaccccca- 480 
cctggacgcc ctcaagtggg aagagcgctg ttcaaggacc acctgagcgt gacccccgcg 540* 
gccgccacag cgtgtccagt gtcctgccgg gctgtgccga gccacggaac catgggaaga 600 
ccttcacttg caccgctgcc taccccgagt ccaagacccc gctaaccgcc accctctcaa 660 
aatccggaaa cacattccgg cccgaggtcc acctgctgcc gccgccgtcg gaggagctgg 720 
ccctgaacga gctggtgacg ctgacgtgcc tggcacgcgg ccccagcccc aaggacgcgc 7B0 
tggttcgctg gctgcagggg tcacaggagc tgccccgcga gaagtacctg acttgggcac 840 
cccggcagga gcccagccag ggcaccacca ccttcgctgt gaccagcaca ctgcgcgtgg 900 
cagccgagga ctggaagaag ggggacacct tcccccgcac ggtgggccac gaggccccgc 960 
cgctggcctt cacacagaag accatcgacc gcttggcggg caaacccacc catgecaatg 1020 
tgtctgttgt cacggcggag gtggacggca cccgctactg a 1061 

[SEQUENCE ID NO: 15] 

GenBank 

J00220 

LOCUS HUMIGCC8- 2 53 3 bp DNA PRI 02 -DEC- 1998 

DEFINITION Homo sapiens immunoglobulin alpha-1 heavy chain constant region 

(IGHA1) gene, partial cds - 
ACCESSION J00220 

VERSION J00220.1 GI: 184743 

KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 
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FIGURE 7B 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
COMMENT 



FEATURES 

source 



intron 



gene 



intron 



intron 



Eukaryota; Metazoa,- Chordata,- Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini .- Hominidae ,- Homo. 
1 (bases 1 to 2533) 

Takahashi.N. , Ueda,s., Obata.M., Nikaido. T. , . Nakai , S . and Hon jo, T. 
Structure of human immunoglobulin gamma genes: Implications for 
evolution of a gene family 
Cell 29, 671-679 (1982) 
83001943 

This sequence is part of a multigene region containing the 
immunoglobulin heavy chain gamma- 3, gamma- 1, pseudo-epsilon, and 
alpha- 1 genes. 

Location/Qualifiers 

1..2S33 

/organism- -Homo sapiens" 
/db_xref - " taxon : 9ff06 " 
/chromosome= M l4 " 
/map- , *l4q32 .33" 

/clones "cosmid Igi3 ; Ig-gamma3 -122 " 
/tissue_type- "placenta; liver" 
/germline 
<1 . . 141 

/note»"alpha-l intron J-C" 
142. .447 
/genea-IGHAl" 
<142 . .>1638 
/gene- ,, IGHAl u 
. join {< 142. .447,662. .1021,124 4 . .163 8) 
/gene«"lGHAl" 
/codon_starto3 

/product immunoglobulin alpha -l heavy chain constant 
region" 

/protein_id»"AAC82S2 8 .1" 
/db_xref» w GI; 18474 9" 

/ 1 ran s 1 a t i on » " S PTS P KV FPLS LCS TQPDGNW I ACLVQGF F PQE P LS VTWS ESG 
QGVTARNFPPSQDASGDLYTTSSQLTLPATQCLAGKSVTCHVFCHYTNPSQDVTVPCPV 
PSTPPTPSPSTPPTPSPSCCHPRLSLHRPALEDLLLGSEANLTCTLTGLRDASGVTFT 
WTPSSGKSAVQGPPERDLCGCYSVSSVLPGCAEPWNHGKTFTCTAAYPESKTPLTATL 
S KS GNTFRP EVHLL P P PS EE LALNELVTLTCLARGFS P KDVLVRWLQGS Q E L PRE KYL 
TWAS RQ E P SQGTTTFAVTS I LRVAAED WKKGDTFS CMVGHEALPLAFTQ KT I DRLAGK 
PTHVNVSWMAEVDGTCY " 
448 . .661 
/gene»"IGHAl" 
/note* "A" 
662. .1021 
/gene-"IGHAl» 
1022 . .1243 
/geneo-iGHAl" 
/note»"B" 
1244 . . >163B 
/gene*"IGHAl" 

866 c 753 g 424 t 



490 a 



BASE COUNT 
ORIGIN 

1 ggtccaactg caggcctgtg gtgcaggagc tgtgtgacca tggggctgtc accaggcctc 

61 tctgtgctgg gttcctccag tatagaggag aggcagtata gaggagaggg ccgcgtcctc 

121 acagtgcatt ctgtgttcca gcatccccga ccagccecaa ggtcttcccg ccgagcctct 

181 gcagcaccca gccagatggg aacgtggtca tcgcctgcct ggcccagggc ttcttccccc 

241 aggagccact cagtgtgacc tggagcgaaa gcggacaggg cgtgaccgcc agaaacttcc 

301 cacccagcca ggatgcctcc ggggacctgt acaccacgag cagccagccg accctgccgg 

361 ccacacagcg cctagccggc aagtccgtga catgccacgt gaagcaccac acgaatccca 

421 gccaggacgt gactgtgccc cgcccaggcc agagggcagg ctggggagtg gggcggggcc 

481 accccgtcgt gccctgacac tgcgcctgca cccgcgtccc ccacagggag ccgccccttc 

541 actcacacca gagtggaccc cgggccgagc cccaggaggt ggtggtggac aggccaggag 

601 gggcgaggcg ggggcatggg gaagtatgtg ctgaccagct caggccatct ctccactcca 

661 gttccctcaa ctccacctac cccatctccc tcaactccac ctaccccatc tccctcacgc 

721 tgccaccccc gactgtcact gcaccgaccg gccctcgagg acctgctcct aggttcagaa 

781 gcgaacctca cgtgcacacc gaccggcctg agagatgcct caggcgccac cttcacctgg 
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FIGURE 7C 

gtgggaagag cgctgttcaa ggaccacccg agcgtgacct ctgtggctgc 

ccagtgtcct gccgggctgt gccgagccat ggaaccatgg gaagaccttc 

ctgcctaccc cgagtccaag accccgccaa ccgccaccct ctcaaaatcc 

gaccctgctc ggggccctgc tcagtgctct ggtctgcaaa gcatattcct 

ccccccccca atcctgggct ccagtgccca tgccaagcac acagggaaac 

tgaggggcca ggacacagcc cggggcgccc accagagcag aggggctccc 

cccagccccc tgacctggct ctccaccctc caggaaacac actccggccc 

tgctgccgcc gccgtcggag gagctggccc tgaacgagct ggegacgceg 

cacgcggctt cagccccaag gacgtgccgg ttcgctggct gcaggggtca 

cccgcgagaa gtacctgacc cgggcacccc ggcaggagcc cagccagggc 

tcgctgtgac cagcacactg cgcgtggcag ccgaggaccg gaagaagggg 

cctgcatggt gggccacgag gccctgccgc tggcctccac acagaagacc 

tggcgggtaa acccacccat gtcaatgtgt ctgttgtcat ggcggaggcg 

gctactgagc cgcccgcccg cccccacccc tgaataaact ccatgctccc 

ccacgcttcc atccggcgcc tgtctgtcca tcctcagggt ctcagcaccc 

cagggcatgg acagggaaga acaccccctg ccctgagcct cggggggccc 

catgagactt tccaccccgg tgcgagtgcg agttgtgagt gtgagagtgc 

aggcctcgct ggtgtgagat cttaggcctg ccaaggcagg cacagcccag 

gagagacgca cacgccccgg acagtcctga gcgagcagcg gcatggccgt 

gagagccgcc tctggctgta gccgggaggg aatagggagg gtaaaaggag 

aagaaaggcg caggcagtgg caggagcggc gagggagcga ggggctggac 

cactgggagg acaagctcca ggagggcccc accaccccag tgggtgggcc 

ccactgacgc acgcaggaag gggcacctcc ccttaaccac actgctctgt 

tgggcacagg tgcacactca cactcacata tatgcctgag ccctgcagga 

cacagcccag acccagttcc agaaaagcca ggggagcccc ctcccaagcc 

gcctgctccc ctaggcccct ccggcccccc tgtgtttcca ctgtgcacag 

aaccccacag acccctccca ggcagccccc gctccctgcc tggccaagtc 

tcctaagccc aaccaggacc caaagcatag acagggaggg gccacgcggg 

[SEQUENCE ID NO: 52] 

n. HUMAN IG ALPHA-2 CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 



>Sp J P018 77 J ALC2_HUMAN 


IG ALPHA- 2 


CHAIN C REGION - Homo 


sapiens (Hi 


10 
1 


20 


30 


40 


50 


60 


ASPTSPKVFP 


1 

LSLDSTPQDG 


1 

NVWACLVQG 


1 

FFPQEPLSVT 


i 

WSESGQNVTA 


1 

RNFPPSQDAS 


70 
1 


80 
1 


90 


100 


110 


120 


GDLYTTSSQL 


TLPATQCPDG 


i 

KSVTCHVKHY 


1 

TNPSQDVTVP 


1 

CPVPPPPPCC 


1 

HPRLSLHRPA 


130 
1 


140 

1 


150 


160 


170 


180 


LEDLLLGSEA 


NLTCTLTGLR 


1 

DASGATFTWT 


1 

PSSGKSAVQG 


1 

PPERDLCGCY 


1 

SVSSVLPGCA 


190 

! 


200 


210 


220 


230 


240 


QPWNHGETFT 


1 

CTAAHPELKT 


1 

PLTANITKSG 


1 

NTFRPEVHLL 


1 

PPPSEELALN 


I 

ELVTLTCLAR 



841 acgccctcaa 
901 cacagcgcgt 
961 acttgcactg 
1021 ggtgggccca 
1081 ggcctgcccc 
1141 tgaggcaggc 
1201 tcatcccctg 
1261 gaggcccacc 
13 21 acgtgcctgg 
1381 caggagccgc 
1441 accaccacct 
1501 gacaccctct 
1561 atcgaccgct 
1621 gacggcacct 
1681 ccaagcagcc 
1741 gggaaagggc 
1801 ctggcacccc 
1861 gtggtgcagg 
1921 gatgggttct 
1981 ttgtccccga 
2 041 caggctagcc 
2101 cccagggccc 
2161 tcaggacgtc 
2221 acggggcacg 
22 81 gcggaacgtt 
2341 cccaagctca 
2401 accaggcacc 
2461 tcccacccct 
2521 gtggcaccag 
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FIGURE 7D 



250 260 270 280 290 300 

I I t I I | 

GFSPKDVLVR WLQGSQELPR EKYLTWASRQ EPSQGTTTFA VTSILRVAAE DWKKGDTFSf 



310 320 330 340 

I I I I 

MVGHEALPLA FTQKTIDRLA GKPTHWVSV VMAEVDGTCY 



[SEQUENCE ID NOsl8] 



CODING SEQUENCE 



catccccgac 
acgtggtcgt 
ggagcgaaag 
gggacctgta 
agtccgtgac 
gcccagttcc 
tcgaggacct 
atgcctctgg 
cacccgagcg 
agccacggaa 
cactaaccgc 
cgccgccgtc 
gcttcagccc 
agaagtacbt 
tgaccagcat 
tggtgggcca 
gtaaacccac 



cagccccaag 
cgcatgcctg 
cggacagaac 
caccacgagc 
atgccacgtg 
cccacctccc 
gctcttaggt 
tgccaccttc 
tgacctctgt 
ccacggggag 
caacatcaca 
ggaggagccg 
caaggatgtg 
gacttgggca 
actgcgcgtg 
cgaggccctg 
ccacgtcaac 



gtcttcccgc 
gcccagggcc 
gtgaccgcca 
agccagccga 
aagcaccaca 
ccatgctgcc 
ccagaagcga 
acctggacgc 
ggctgctaca 
accttcacct 
aaatccggaa 
gccctgaacg 
ctggttcgct 
tcccggcagg 
gcagccgagg 
ccgctggcct 
gtgtctgttg 



tgagcctcga 
tct tccccca 
gaaact tccc 
ccctgccggc 
cgaatcccag 
acccccgact 
accccacgtg 
cctcaagtgg 
gcgtgtccag 
gcactgctgc 
acacattccg 
agctggtgac 
ggccgcaggg 
agcccagcca 
actggaagaa 
ccacacagaa 
tcatggcgga 



cagcaccccc 
ggagccactc 
acctagccag 
cacacagtgc 
ccaggacgcg 
gtcgctgcac 
cacaccgacc 
gaagagcgcc 
tgtcctgcct 
ccaccccgag 
gcccgaggtc 
gctgacgtgc 
gccacaggag 
gggcaccacc 
gggggacacc 
gaccatcgac 
ggtggacggc 



caagatggga 
agtgtgacct 
gatgcccccg 
ccagacggca 
actgcgccct 
cgaccggccc 
ggcctgagag 
gcccaaggac 
ggctgcgccc 
t cgaagaccc 
cacctgctgc 
ctggcacgtg 
ccgccccgcg 
accctcgctg 
cccccccgca 
cgcc tggcgg 
acctgctact 

C SEQUENCE 



-1 
60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1022 
ID NO: 17] 



GenBank 

J00221 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 



Human Ig germline 
HUMIGCD7 2516 bp DNA PRI 11 -APR- 2001 

Human Ig germline H-chain G-E-A region B: alpha-2 A2m(l) allele 
constant region, 3 ' end. 
J00221 

J00221.1 GI:1S4756 

human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 2516) 
Ellison, J. and Hood.L. 

Linkage and sequence homology of two human immunoglobulin gamma 
heavy chain constant region genes 

Proc. Natl. Acad. Sci. U.S.A. 79 (6), 1984-1988 (1982) 

82197621 

6804948 

2 (bases 737 to 1016) 
Flanagan, J. G. and Rabbitcs,T.H. 

Arrangement of human immunoglobulin heavy chain constant region 

genes implies evolutionary duplication of a segment containing 

gamma, epsilon and alpha genes 

Nature 300 (5894), 709-713 (1982) 

83088998 

6817141 

3 (bases 49 CO 229; 425 to 514) 

Hisajima,H.. Nishida,Y., Nakai,s., Takahashi , N . , Ueda/S. and 
Honjo.T. 

Structure of the human immunoglobulin C epsilon 2 gene, a truncated 
pseudogene: implications for its evolutionary origin 
Proc. Natl. Acad. Sci. U.S.A. 80 (10), 2995-2999 (1983) 
83221529 
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FIGURE 7E 

- PUBMED 640700S 
REFERENCE 4 (bases 1 to 2516) 

AUTHORS Flanagan . J . G . . Lefranc.M.P. and Rabbi tts , T . H . 

TITLE Mechanisms of divergence and convergence of Che human 

immunoglobulin alpha 1 and alpha 2 constant region gene sequences 
JOURNAL Cell 36 (3), 661-688 (1984) 
MEDLINE 64130179 
PUBMED 6421489 

COMMENT [3] also reports the complete alpha- 1 gene and part of the A2m(2) 

alpha-2 allele (bases 737-2516; see Sites table) . Comparison of the 
three sequences suggests that the A2m(l) alpha-2 allele might be a 
hybrid of the alpha- 1 gene and A2m(2) alpha-2 allele. The hinge 
region in the alpha genes occurs at beginning of the CH2 domain. 
The alpha- 1 hinge region is 13 amino acids longer than that in 
alpha-2. Both hinge regions consist of approximate tandem repeats 
of a 15 bp sequence. The first repeat occurs 5' to the mRNA splice 
site and is non-coding. The authors [3] suggest that this 
repetitive structure provides a possible mechanism for the large 
number of variations observed in hinge regions. There is a coupled 
30 bp insertion, 9 bp deletion in alpha-2 relative to alpha-1 
(starting at base 97) . 

(1] also reports sequences for the epsilon-1 and epsilon-2 
(pseudogene) C- region genes. The authors [1] determined the 
physical linkage between epsiion-1 and alpha-2 and that between 
epsilon-2 and alpha-1. [2] also reports the alpha-1 CH2 domain and 
epsilon-2. 

This entry is part of a multigene region (region B) , which includes 
the gamma-2, gamma-4, epsilon-1 and alpha-2 genes. See segment 1 
for more comments . 
Complete source information: 

Human genomic DNA, cosmid IglO [1] , [3] ; placenta DNA [2] clone 
H-Ig -alpha-2 5; genomic DNA from TOU II -5 library clone 
lambda- T0U-alpha2 (for A2m(2) allele) [3] . 
FEATURES Location/Qualif iers 

source 1 . .2516 

/organism- "Homo sapiens" 
/db_xre f - - t axon : 9 6 0 6 " 
/map-"14q32 .33" 
/germline 
gene <1..1621 

/gene-"IgH" 
/note-"IGHA2" 
intron <i . . 163 

/gene-" IgH" 

/note»"alpha-2 intron J-C" 
CDS join(<164. .469,684. .1004,1227. .1621) 

/gene«"IgH" 

/note- "contains constant region" 
/codon_start-3 

/product* "immunoglobulin alpha-2 heavy chain" 
/protein_id«"AABS93 96 . 1" 
/db_xref -"GI : 184761" 

/translation- "SPTSPKVFPLSLDSTPQDGNWVACLVQGFFPQEPLSVTWSESG 
ONWARNFPPSQDASGDLYTTSSQLTLPATQCPDGKSVTCHVKHYTNPSQDVTVPCPV 
PPPPPCCHPRLSLHRPALEDLLLGSEANLTCTLTGLRDASGATFTWTPSSGKSAVQGP 
P ERDLCGCYSVS S VL PGCAQ P WNHGET FTCTAAHP E LKT P LTAN I TKS GNTFRPEVHL 
LP P PS EELALNEL VTLTCLARG FS P KDVLVRWLQG5QELP RE KYLTWASRQE PS QGTT 
TFAVTS I LRVAAEDW KKGDTFS CMVGHEALP LAFTQKT I DRLAGKPTHVNVS WMAEV 
DGTCY" 

exon 164 . .46 9 " 

/gene-"IgH" 

/note- "GOO- 119-333" 
intron 470 . . 683 

/gene»"lgH" 

/note- -alpha -2 intron A" 
exon 684.. 1004 
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FIGURE 7F 



variation 



variation 



variation 



variation 



variation 



variation 



variation 



variation 
variation 



variation 



/gene»"IgH n 
100S. .1226 
/gene«"IgH" 

/note="alpha-2 intron B" 
1227. .1621 
/gene-"IgH" 
1434 

/gene="IgH" 

/note-"t in A2m(l> ; a in A2m(2)" 
1441 

/gene-"IgH" 

/note="g in A2m(l) ; a in A2m{2) " 
1465 ■ 
/gene»"IgH" 

/note-^c in A2m(l) ; t in A2m(2)" 
1486 

/gene-"IgH" 

/note--c in A2m(l> ; g in A2m(2)" 
1553 

/gene-"IgH" 

/note= "t in A2m(l); a in A2ro(2)" 
1573 . .1574 
/gene-"IgH° 

/note-"tg in A2m{l) ; ca in A2m(2) 
1602 . . 1606 
/gene» "IgH" 

/note«*'tggac in A2m(l) ,- cggat in A2m{2) " 
2060 

/note-"c in A2m(l) ; t in ' A2m(2) " 
23B4 
/note- 
2390 

/note="c 
a 861 



2) - 



"a in A2m(l); c in A2m(2)" 



in A2m(l) 
c 754 g 



in A2m(2) * 
413 t 



BASE COUNT 488 
ORIGIN 

1 ggtccaaccg caggcccatg gtgcaggagc tgtgtaacct atggggctgt caccaggcct 

61 ctctgtgctg ggttcctcca gtgtagagga gaggcaggta cagcctgtcc tcctggggac 

121 atggcatgag ggccgcgtcc tcacagcgca ttctgtgttc cagcatcccc gaccagcccc 

181 aaggtcttcc cgctgagcct cgacagcacc ccccaagatg ggaacgcggc cgncgcatgc 

241 ctggtccagg gcctcttccc ccaggagcca ctcagtgtga cctggagcga aagcggacag 

301 aacgtgaccg ccagaaactt cccacctagc caggatgcct ccggggacct gtacaccacg 

361 agcagccagc tgaccctgcc ggccacacag tgcccagacg gcaagtccgt gacatgccac 

42l~gtgaagcact acacgaatcc cagccaggat gcgactgtgc cctgcccagg tcagagggca 

481 ggctggggag tggggcgggg ccaccccgtc ctgccctgac actgcgcctg cacccgtgtt 

541 ccccacaggg agccgcccct tcactcacac cagagtggac cccgggccga gccccaggag 

601 gtggtggtgg acaggccagg aggggcgagg cgggggcacg gggaagggcg ttctgaccag 

661 ctcaggccat ctctccactc cagttccccc acctccccca tgctgccacc cccgactgtc 

721 gctgcaccga ccggccctcg aggacctgct cttaggttca gaagcgaacc tcacgtgcac 

781 actgaccggc ctgagagatg cctctggtgc caccttcacc tggacgccct caagtgggaa 

841 gagcgctgtt caaggaccac ctgagcgtga cctctgtggc tgctacagcg tgtccagtgt 

901 cctgcctggc tgtgcccagc catggaacca tggggagacc ttcacctgca ctgctgccca 

961 ccccgagctg aagaccccac taaccgccaa catcacaaaa tccggtgggt ccagaccctg 

1021 ctcggggccc tgctcagtgc tctggtttgc aaagcatatt cccggcctgc ctcctccctc 

lpai ccaatcctgg gctccagtgc tcatgccaag tacacaggga aactgaggca ggctgagggg 

1141 ccaggacaca gcccagggtg cccaccagag cagaggggct ctctcatccc ctgcccagcc 

1201 ccctgacctg gctctctacc ctccaggaaa cacattccgg cccgaggtcc acctgctgcc 

1261 gccgccgtcg gaggagctgg ccctgaacga gctggtgacg ctgacgtgcc tggcacgtgg 

1321 cttcagcccc aaggatgtgc tggttcgctg gctgcagggg tcacaggagc tgccccgcga 

13 81 gaagtacccg acttgggcac cccggcagga gcccagccag ggcaccacca ccttcgctgt 

1441 gaccagcata ctgcgcgtgg cagccgagga ccggaagaag ggggacacct tctcctgcat 

1501 ggtgggccac gaggccctgc cgctggcctt cacacagaag accatcgacc gcttggcggg 

1561 taaacccacc catgtcaatg cgtctgttgt catiggcggag gtggacggca cctgctactg 

1621 agccgcccgc ctgtccccac ccctgaataa actccatgct cccccaagca gccccacgct 

1681 tccatccggc gcctgtctgt ccatcctcag ggcctcagca cttgggaaag ggccagggca 

1741- tggacaggga agaatacccc ctgccctgag cctcgggggg cccctggcac ccccacgaga 

1801 ccttccaccc tggtgtgagt gtgagttgtg agtgtgagag tgtgtggtgc aggaggcccc 
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1861 gctggcgcga gatcttaggt jrtgccaaggc aggcacagcc caggatgggc tccgagagac 

1921 gcacacgccc cggacagttc tgagtgagca gtggcatggc cgtttgtccc tgagagagcc 

1981 gcctctggcc gcagccggga gggaataggg agggtaaaag gagcaggcta gccaagaaag 

2041 gcgcaggtag cggcaggagc ggcgagggag tgaggggctg gaccccaggg ccccactggg 

2101 aggacaagct ccaggagggc cccaccaccc tagtgggtgg gcctcaggac gccccaccga 

2161 cgcatgcagg aaggggcacc tccccctaac cacactgctc tgtacggggc acgtgggcac 

2221 acacgcacac tcacactcac acatacgcct gagccccgca ggagcggaac gttcacagcc 

2291 cagacccagt tccagaaaag ccaggggagt ccccccccaa gcccccaagc tcagcccgct 

2341 cccccaggcc cccctggctt ccctgtgttc ccactgcgca cagaccaggc accaactcca 

2401 cagacccctc ccaggcagcc cctgccccct gcctggccaa gtcccccacc ccttcctaag 

2461 cccaactagg acccaaagca tagacaggga ggggccgcgt ggggtggcat cagaag 

[SEQUENCE ID NO: 53] 



ID. HUMAN IG GAMMA-1 CHAIN C REGION - HOMO SAPIENS (HUMAN) 

AMINO ACID SEQUENCE 

>sp|P01857 |GC1_HUMAN IG GAMMA-1 CHAIN C REGION - Homo sapiens (Human). 

10 20 30 40 50 60 

1 I I I I I 

ASTKGPSVFP LAPSSKSTSG GTAALGCLVK DYFPEPVTVS WNSGALTSGV HTFPAVLQSS 

70 80 90 100 110 120 

I II I I I 

GLYSLSSWT VPSSSLGTQT YICNVNHKPS NTKVDKKVEP KSCDKTHTCP PCPAPELLGG 

130 140 150 160 170 180 

I I I I I I 

PSVFLFPPKP KDTLMISRTP EVTCVWDVS HEDPEVKFNW YVDGVEVHNA KTKPREEQYN 



190 200 210 220 230 240 

I I I I I I 

STYWVSVLT VLHQDWLNGK EYKCKVSNKA LPAPIEKTIS KAKGQPREPQ VYTLPPSRDE 

250 260 270 280 290 300 

I I I I I I . 

LTKNQVSLTC LVKGFYPSDI AVEWESNGQP ENNYKTTPPV LDSDGSFFLY SKLTVDKSRW 

310 320 330 

III 
QQGNVFSCSV MHEAIjHNHYT QKSLSLSPGK 

[SEQUENCE ID NO: 2 0] 



CODING SEQUENCE 

g -1 

cctccaccaa gggcccatcg gccctccccc tggcaccctc ctccaagagc acccccgggg 6 0 

gcacagcggc cctgggccgc ctggtcaagg actacttccc cgaaccggtg acggtgtcgt 120 

ggaacccagg cgccctgacc agcggcgtgc acaccttccc ggetgtccta cagtcctcag 180 

gactccactc cctcagcagc gcggcgaccg tgccctccag cagcttgggc acccagaccc 24 0 

acatctgcaa cgtgaatcac aagcccagca acaccaaggt ggacaagaaa gctgagccca 3 00 

aatcttgtga caaaactcac acacgcccac cgtgcccagc acccgaactc ctggggggac 360 

cgtcagectt ccccttcccc ccaaaaccca aggacacccc catgatcccc cggacccctg 420 

aggtcacatg cgtggtggtg gacgtgagcc acgaagaccc tgaggtcaag ttcaactggt 4 80 

acgcggacgg cgcggaggtg cataatgcca agacaaagcc gcgggaggag cagtacaaca 540 

gcacgcaccg ggcggccagc gccctcaccg tcctgcacca ggactggctg aacggcaagg 600 

agcacaagcg caaggcctcc aacaaagccc ccccagcccc caccgagaaa accatctcca 660 

aagccaaagg gcagccccga gaaccacagg cgtacacccc gcccccatcc cgggacgagc 720 

cgaccaagaa ccaggtcagc ctgacccgcc tggtcaaagg cttctatccc agcgacatcg 7 80 

ccgtggagtg ggagagcaac gggcagccgg agaacaacta caagaccacg ccccccgcgc 84 0 

cggactccga cggctccttc tccccccaca gcaagcccac cgcggacaag agcaggtggc 900 

agcaggggaa cgtcttccca cgctccgtga tgcacgaggc cccgcacaac caccacacgc 96 0 

agaagagccc ccccccgtct ccgggcaaac ga 992 
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[SEQUENCE ID NO: 19] 



GenBank 

J00228 . 
LOCUS 

DEFINITION 

ACCESSION 
VERSION - 
KEYWORDS 
SOURCE 

ORGANISM 



HUMIGCC4 2009 bp DNA PRI 02-DEC-1998 

Homo sapiens immunoglobulin gamma -1 heavy chain constant region 
(IGHG1) gene, partial cds - 
J00228 

J00228.1 GI:184739 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
COMMENT 



FEATURES 

source 



gene 

intron 

CDS 



human . 

Homo sapiens 

Eukaryota; Meta'zoa,- Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 {bases 1 to 2009) 

Takahashi,N. , Ueda.S., Obata.M., Nikaido,T., Nakai.S. and Honjo.T. 
Structure of human immunoglobulin gamma genes: Implications for 
evolution of a gene family 
Cell 29, 671-679 (1982) 

85001943 • • 

This sequence is part of a multigene region containing the 
immunoglobulin heavy chain gamma-3, gamma- 1, pseudo-epsilon, and 
alpha-1 genes: 

Location/Qualifiers 
1..2009 

/organisms "Homo sapiens" - 
/db_xref -"taxon: 9606" 
/ chromos ome - " 1 4 M 
/map«"14q32 .33 " 

/ clone- "cosmid Igl3 ; Ig-gamma3 -122" 
/tissue_type» "placenta; liver" 
/germline 
<1 . . >1803 
/gene="IGHGl" 
<1. .209 
/gene-"IGHGl" 

join{<210. .503, 892. .936,105 5. .1384, 1481. .1803) 
/gene»»IGHGl" 
/codon_start«3 

/product- "immunoglobulin gamma -1 heavy chain constant 
region* 

/protein_ido"AAC82527 . l" 

/db_xref-"GI : 184747" 
• / translat ion= "STKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGA 

LTSGVHT F P AVLQS SGL YS LSS WTVPS S S LGTQTY I CNVNHKP SNTKVDKKVE P KSC 

DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWY 

VDGVEVHNAKTKPRE EQ YNS TYR WSVLTVLHQD WLNGKE YKCKVSNKALP AP IEKTI 

S KAKGQP RE PQVYTL P P S RDELTKNQVS LTCLVKG FY PSDI AVEWESNGQ P ENNYKTT 

P PVLDSDGS FFLYS KLTVDKSRWQQGNVFSCS VMHEALHNHYTQKSLSLS PGK" 
misc_dif f erence 563 

/gene-"IGHGl" 

/replace-" " 
misc_diff erence 593 

/gene="IGHGl" 

/replace""" 
misc_diff erence 614 

/gene- u IGHGl" 

/replace- " - 
misc_diff erence 633 

/gene- w IGHGl» 

/replace*"" 
misc_diff erence 643 

/gene="IGHGi" 

/replace* »" 
misc_diff erence 654 

/gene»"IGHGl" 
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/replace-" " 
misc_dif f erence 684 

/gene»"IGHGl" 

/replace- ■ " 
misc_dif f erence €92 

/gene»"IGHGl" 

/replace- u ■ 
misc^diff erence 765.. 756 

/gene° M IGHGl" 

/replace-"" 
misc_diff erence 147S 

/gene-"IGHGl" 

/replace-" ■ 
misc_diff erence 1578 

/gene-"IGHGl" 

/replace-"" 

BASE COUNT 418 a 698 c 566 g 327 t 

ORIGIN 

1 agctttctgg ggcaggccag gcctgacctt ggctttgggg cagggagggg gccaaggtga 

61 ggcaggcggc gccagcaggt gcacacccaa tgcccatgag cccagacacc ggacgctgaa 

121 ccccgcggac agttaagaac ccaggggcct ctgcgcctgg gcccagccct gtcccacacc 

181 gcggtcacat ggcaccacct ctcttgcagc ccccaccaag ggcccatcgg tcttccccct 

241 ggcaccctcc tccaagagca cccctggggg cacagcggcc ctgggctgcc tggtcaagga 

301 ccacttcccc gaaccggtga cggtgtcgtg gaactcaggc gccctgacca gcggcgtgca 

361 caccttcccg gctgtcccac agtcctcagg accctacccc ctcagcagcg tggtgaccgc 

421 gccctccagc agcttgggca cccagaccta cacccgcaac gtgaatcaca agcccagcaa 

481 caccaaggtg gacaagaaag ttggtgagag gccagcacag ggagggaggg tgtctgctgg 

541 aagcaggccc agcgctcctg cctggacgca tcccggctat gcagccccag cccagggcag 

601 caaggcaggc cccgtctgcc tcttcacccg gagcctctgc ccgccccact catgctcagg 

661 gagagggtct tctggcttt t tcccaggctc tgggcaggca caggctaggt gcccctaacc 

721 caggccctgc acacaaaggg gcaggtgctg ggcccagacc cgccaagagc catatccggg 

781 aggaccctgc ccccgaccca agcccacccc aaaggccaaa ctccccactc cctcagctcg 

841 gacacccccc ctcctcccag attccagcaa ctcccaacct tctctctgca gagcccaaat 

901 cttgtgacaa aactcacaca tgcccaccgc gcccaggtaa gccagcccag gcctcgccct 

961 ccagcccaag gcgggacagg tgccccagag tagcctgcac ccagggacag gccccagccg 

1021 ggtgctgaca cgcccacccc catctctccc tcagcacctg aactcctggg gggaccgtca 

1081 gtcttcctct cccccccaaa acccaaggac accctcatga tctcccggac ccccgaggcc 

1141 acatgcgtgg tggtggacgt gagccacgaa gaccctgagg ccaagttcaa ccggtacgtg 

1201 gacggcgcgg aggcgcacaa tgccaagaca aagccgcggg aggagcagta caacagcacg 

1261 taccgggtgg tcagcgtcct caccgtcctg caccaggact ggccgaatgg caaggagcac 

1321 aagtgcaagg tctccaacaa agccctccca gcccccatcg agaaaaccat ccccaaagcc 

1381 aaaggtggga cccgtggggt gcgagggcca catggacaga ggccggctcg gcccaccccc 

1441 tgccccgaga gtgaccgctg taccaacctc tgtcctacag ggcagccccg agaaccacag 

1501 gtgcacaccc tgcccccacc ccgggatgag ctgaccaaga accaggtcag cctgacccgc. 

1561 ctggtcaaag gcccctatcc cagcgacatc gccgtggagt gggagagcaa tgggcagccg 

1621 gagaacaact acaagaccac gcctcccgtg ctggactccg acggctcctt ccccccctac 

1681 agcaagccca ccgcggacaa gagcaggtgg cagcagggga acgtcttccc acgctccgcg 

1741 atgcatgagg ccctgcacaa ccactacacg cagaagagcc tctccctgtc tccgggtaaa 

1801 cgagtgcgac ggccggcaag ccccgctccc cgggctcccg cggccgcacg aggatgcttg 

1861 gcacgcaccc cctgtacata ctccccgggc gcccagcacg gaaacaaagc acccagcgcc 

1921 gccctgggcc cctgcgagac tgtgatggfct ctttccacgg gccaggccga gtctgaggcc 
1981 cgagtggcat gagggaggca gagcgggtc 

[SEQUENCE ID NO: 54] 

IV. HUMAN IG GAMMA-2 CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>sp| P01859 |GC2_HUMAN IG GAMMA-2 CHAIN C REGION - Homo sapiens (Human). 

10 20 30 40 50 60 

I I ( II I 

ASTKGPSVFP LAPCSRSTSE STAALGCLVK DYFPEPVTVS WNSGALTSGV HTFPAVLQSS 

70 80 90 100 110 120 
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i i i i i i 

GLYSLSSWT VPSSNFGTQT YTCNVDHKPS NTKVDKTVER KCCVECPPCP APPVAGPSVF 

130 140 150 160 170 180 

I I ! I I I 

LFPPKPKDTL MISRTPEVTC VWDVSHEDP EVQFNWYVDG VEVHNAKTKP REEQFNSTFR 

190 200 210 220 230 240 

I I I I I I 

WSVLTWHQ DWLNGKEVKC KVSNKGLPAP IEKTISKTKG QPREPQVYTL PPSREEMTKN 

250 260 -270 280 290 300 

I I I I I I 

QVSLTCLVKG FYPSDIAVEW ESNGQ PENNY KTTPPMLDSD GS FFLYSKLT VDKSRWQQGN 



310 320 326 

I I ! 

VFSCSVMHEA LHNHYTQKSL SLSPGK 



[SEQUENCE ID NO: 22] 



CODING SEQUENCE 

gcctccacca agggcccatc ggtcttcccc ccggcgccct gctccaggag 
agcacagccg ccctgggctg cctggtcaag gactacttcc ccgaaccggt 
tggaacccag gcgccctgac cagcggcgtg cacaccttcc cagccgtcct 
ggactctact ccctcagcag cgtggtgacc gcgccctcca gcaacttcgg 
tacacctgca acgtagatca caagcccagc aacaccaagg tggacaagac 
aaatgttgtg tcgagtgccc accgtgccca gcaccacctg tggcaggacc 
ctcttccccc caaaacccaa ggacaccctc atgatctccc ggacccctga 
Stggtggcgg acgcgagcca cgaagacccc gaggtccagt tcaactggta 
gtggaggcgc ataacgccaa gacaaagcca cgggaggagc agctcaacag 
gtggtcagcg tccteaccgt tgtgcaccag gactggctga acggcaagga 
aaggtctcca acaaaggcct cccagccccc accgagaaaa ccatctccaa 
cagccccgag aaccacaggt gtacaccctg cccccacccc gggaggagat 
caggccagcc tgacctgcct ggtcaaaggc tcctacccca gcgacatcgc 
gagagcaacg ggcagccgga gaacaactac aagaccacac ctcccatgct 
ggctccttct tcctctacag caagctcacc gtggacaaga gcaggcggca 
gtcttctcat gctccgtgat gcatgaggct ccgcacaacc actacacgca 
tccctgtctc cgggtaaa 



cacccccgag 
gacggtgtcg 
acagtcctca 
cacccagacc 
agttgagcgc 
gtcagtcttc 
ggtcacgtgc 
cgtggacggc 
cacgttccgt 
gtacaagcgc 
aaccaaaggg 
gaccaagaac 
cgtggagtgg 
ggactccgac 
gcaggggaac 
gaagagcctc 

[SEQUENCE 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
976 
ID NOs21] 



Genfiank 

J00230. Human Ig germline 



LOCUS 
DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



HUMIGCD1 



2 009 bp DNA 



PRI 



11 -APR- 2 001 



Human Ig germline H-chain G-E-A region B: gamma-2 conscant region. 
3 ' end . 
J00230 V00554 
J00230.1 GI:1847S0 



Chorda ca; Craniata; Vertebrata; Euteleostomi ; 
Primates; Catarrhini; Hominidae; Homo. 



human. 

Homo sapiens 
Eukaryota; Metazoa; 
Mammalia; Eutheria; 

1 (bases 1 to 2009) 
Ellison, J. and Hood,L. 

Linkage and sequence homology of two human immunoglobulin gamma 
heavy chain constant region genes 

PrOC. Natl. Acad. .Sci. U.S.A. 79 (S) , 1984-1989 (1982) 

82197621 

6804948 

2 (bases 896 to 1256; 1749 to 1937) 
Krawinkel,U. and Rabbi t ts , T .H . 

Comparison of Che hinge-coding segments in human immunoglobulin 
gamma heavy chain genes and the linkage of the gamma 2 and gamma 4 
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Subclass genes JOURNAL EMBO J. 1 (4), 403-407 (1982) 

MEDLINE 8423S992 PUBMED 6329676 
REFERENCE 3 (bases 4?S to 1071; 1179 CO 1330; 1461 to 1S24) 

AUTHORS Takahashi,N. , Ueda,S., Obata,M., Nikaido,T., Nakai,S. andHonjo.T. 

TITLE Structure of human immunoglobulin gamma genes: implications for 

evolution of a gene family JOURNAL Cell 29 (2) , 671-679 (1982) 

MEDLINE 83001943 PUBMED 6811139' 
COMMENT On Mar 2, 2000 this sequence version replaced gi:327S9. 

[2 J also reports sequences for gamma- 3, gamma -4, and a gamma 
pseudogene. Most of this sequence is 95% homologous with gamma-4. 
The hinge exohs are only 70% homologous. The authors estimate that 
gamma-2 and gamma-4 diverged 6.6 million years ago. The authors in 
[i] speculate that intron-mediated domain transfer played an 
important role in the evolution of human gamma genes. They also 
report the hinge regions of gamma- 1, gamma- 3, gamma-4, and a 
pseudo-gamma gene. [1} estimates the divergence • of the human gamma 
genes to be between 7.7 and 4.4 million years ago. This entry is 
part of a multigene region containing the gamma-2, gamma-4, 
epsilon-1, and alpha-2 genes. The relative locations of the four 
genes were determined by Flanagan and Rabbi tts (Nature 300, 709-713 
(1982)) . They refer to this gene group as region B. The region A 
genes are gamma-3, gamma-l, pseudo-epsilon, alpha-l. Flanagan and 
Rabbits also determined the general locations of the two regions. 
They place region A between the JH/mu/delta region and region B. 
Complete source information: 

Human fetal liver DNA, library of T. Maniatis [3] and Lawn et al 
121 , [1] ; clones p-gamma-2RPA3 [2] , 5A [33 . and Ig-gamma-2 - IS (1} . 
FEATURES Location/Qualifiers source 1..2009 

/organismo "Homo sapiens" 



/map: 
<1 . .215 
<1. .2009 
/note-"IGHG2" 
/gene-"IgH" 
C_region 216.. 508 



intron 
conflict 



conflict 



conflict 



conflict 
conflict 



C_ region 



/db_xref-"t axon: 9606" /map-"14q3 2 . 3 3 " 

/germline intron 
/gene-"IgH" gene 
./gene«"IgH" 
216 . .509 

/note- "GOO- 119- 33 8" 
/gene-"IgH" 

/notes "immunoglobulin heavy chain constant region CHI" 
join(<216. .509,902. .937, 1056 . .1382,1480. .1802) 
/gene="IgH" /codon_start»3 
/product- "immunoglobulin gamma-2 heavy chain" 
/protein_id-"AABS93 93 .1" ■ 
/db_xref-"GI : 184758" 

/ translation- "STKGPSVFP1APCSRSTSESTAALGCLVKDYFPEPVTVSWNSGA 

LTSGVHTF PAvTjQSSGLY S LSS VVTVPS SNFGTQTYTCNVDHKPSNTKVTJKTVE RKCC 

VECP PCPAP P VAGPS VFLFP P KP KDTLM I SRTP EVTCVWDVSHED PEVQ FNWYVDGV 

EVHNAKTKPREEQFNSTFRWSVLTVVHQDWLNGKEYKCKVSNKGLPAP I EKTI S KTK 

GQPREPQVTTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPML 

DSDGS FFLYS KLTVDKSRWQQGNVFSCS VMHEALHNHYTQKSLSLSPGK " 

510.. 901 /gene-"IgH" 

537 /gene-"IgH" 

/citation- [3] /replace-"" 

550.. 551 /gene=-IgH" 

/citation- £3] /replace-"" 

570 /gene-"IgH" 

/citation- [3] /replace-"" 

777.. 778 /gene="lgH" 

/citation- [3] /replace*"" 

791 /gene«"IgH" 

/citation- [3 J /replace-"" 

864 /gene="IgH" 

/citation- [3) /replace-"- 

901.. 936 /gene» M IgH" 

/note- "immunoglobulin heavy chain hinge" 

902.. 937 /gene-"IgH" 

/note="G00-119-33S« intron 938.. 1055 

/gene»"IgH" C_region 10S5..1381 
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FIGURE 7L 



ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 



/gene«"IgH" 

/note- " immunoglobul in heavy chain constant region CH2 " 
1056.. 1382 /gene="IgH" 
/note="G00-H9-338" incron 1383.. 1479 

/gene="IgH" C_region 1479.. 1799 

/gene="IgH" 

/note- "immunoglobulin heavy chain constant region CH3 " 
1480.. 1802 /gene="IgH" 
/note«"GOO-119-338" conflict 1493 

/gene»"IgH* 
/replace-" " 
/gene- "IgH" 
/replace-" ■ 
/gene«"IgH" 
/replace-" " 
/gene= "IgH" 
/replace-" " 
/ gene- " IgH" 
/replace-" " 
/gene="IgH" 
/replace- " " 
/gene="lgH" 
/gene-"IgH" 
/replace*" " 
/gene- "IgH" 
/ replace- " "BASE COUNT 





/citation- 


[3] 


conflict 


1802 . .1806 






/citation- 


[23 


conflict 


1814 . .1815 






/citation- 


(2) 


conflict 


1825 






/citation- 


E2] 


conflict 


1844 . .1853 






/citation- 


[2] 


conflict 


1890 






/citation- 


[2] 


polyA_signal 


1903 . . 1908 




conflict 


1909. .1918 






/citation- 


[2] 


conflict 


1929 . . 1932 






/citation- 


[2] 



agctttctgg 
ggcaggtggc 
ctggaccctc 
cacaccgcgg 
ccccctggcg 
caaggactac 
cgtgcacacc 
gaccgtgccc 
cagcaacacc 
tgctggaagc 
agggcagcaa 
tgctcaggga 
gcccctaccc 
catatccggg 
cctcagctcg 
gagcgcaaat 
tccagctcaa 
gggtgctgac 
cttcctcttc 
gtgcgtggtg 
cggcgtggag 
ccgtgtggtc 
gtgcaaggtc 
aggtgggacc 
ccctgggagt 
tgtacaccct 
tggtcaaagg 
agaacaacta 
gcaagctcac 
tgcatgaggc 
gagtgccacg 
gcacgcaccc 
gccctgggcc 
tgagtggcat 



ggcgagccgg 
gccagccagg 
gtggatagac 
tcacatggca 
ccctgctcca 
ttccccgaac 
ttcccagccg 
tccagcaact 
aaggtggaca 
caggctcagc 
ggcaggcccc 
gagggtcttc 
caggcccttc 
aggaccctgc 
gacaccttct 
gttgtgtcga 
ggcgggacag 
acgtccacct 
cccccaaaac 
gtggacgtga 
gtgcataatg 
agcgtcctca 
tccaacaaag 
cgcggggtat 
gaccgctgtg 
gcccccatcc 
cttctacccc 
caagaccaca 
cgtggacaag 
tctgcacaac 
gccggcaagc 
cgcgtacata 
cctgcgagac 
gagggaggca 



gcctgacttt 
tgcacaccca 
aagaaccgag 
ccacctctcc 
ggagcacctc 
cggtgacggt 
tcctacagtc 
tcggcaccca 
agacagttgg 
cctcctgcct 
atctgtctcc 
tggctttttc 
acacacaggg 
ccctgaccta 
ctcctcccag 
gtgcccaccg 
gtgccctaga 
ccatctcttc 
ccaaggacac 
gccacgaaga 
ccaagacaaa 
ccgttgtgca 
gcctcccagc 
gagggccaca 
ccaacctccg 
cgggaggaga 
agcgacatcg 
cctcccatgc 
agcaggtggc 
cactacacgc 
ccccgctccc 
cttcccaggc 
tgtgatggtt 
gagtgggtc 



410 a 

ggctttgggg 
atgcccgtga 
gggcctctgc 
tgcagcctcc 
cgagagcaca 
gtcgtggaac 
ctcaggactc 
gacctacacc 
tgagaggcca 
ggacgcaccc 
tcacccggag 
caccaggctc 
gcaggtgctt 
agccgacccc 
atccgagtaa 
tgcccaggta 
gtagcctgca 
ctcagcacca 
cctcacgatc 
ccccgaggtc 
gccacgggag 
ccaggactgg 
ccccatcgag 
cggacagagg 
tccctacagg 
tgaccaagaa 
ccgtggagtg 
tggactccga 
agcaggggaa 
agaagagcct 
caggctctcg 
acccagcatg 
ctttccgtgg 



700 c 

cagggagtgg 
gcccagacac 
gcctgggccc 
accaagggcc 
gccgccctgg 
tcaggcgctc 
tactcccrca 
cgcaacgtag 
gctcagggag 
cggctgtgca 
gcctctgccc 
caggcaggca 
ggctcagacc 
aaaggccaaa 
ctcccaatct 
agccagccca 
tccagggaca 
cctgtggcag 
tcccggaccc 
cagttcaact 
gagcagttca 
ctgaacggca 
aaaaccatct 
ccggctcggc 
gcagccccga 
ccaggtcagc 
ggagagcaat 
cggctccttc 
cgtcttctca 
ctccctgtcc 
gggtcgcgtg 
gaaataaagc 
gtcaggccga 



568 g 3 

gctaaggtga 
tggaccctgc 
agctctgtcc 
catcggtctt 
gccgcctggt 
tgaccagcgg 
gcagcgtggt 
atcacaagcc 
ggagggtgtc 
gccccagccc 
gccccactca 
caggctgggt 
cgccaaaagc 
ctgtccactc 
tctctctgca 
ggcctcgccc 
ggccccagct 
gaccgtcagt 
ctgaggtcac 
ggtacgtgga 
acagcacgtt 
aggagtacaa 
ccaaaaccaa 
ccaccctctg 
gaaccacagg 
ctgacctgcc 
gggcagccgg 
ttcctctaca 
tgctccgtga 

cc ggg ta aat 

aggatgcttg 
acccagcgct 
gtctgaggcc 



{ SEQUENCE ID NO t 55] 
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FIGURE 7M 

V. HUMAN IG GAMMA-3 CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

CAA27268 C gamma 3 [Homo sapiens) 



10 


20 


30 


40 


50 


60 


1 

ASTKGPSVFP 


1 

LAPCSRSTSG 


1 

GTAALGCLVK 


1 

DYFPEPVTVS 


I 

WNS G ALTS G V 


1 

HTFPAVLQSS 


70 


80 


90 


100 


110 


120 


1 

GLYSLSSWT 


1 

VPSSSLGTQT 


1 

YTCNVNHKPS 


1 

NTKVDKRVEL 


1 

KTPLGDTTHT 


1 

CPRCPEPKSC 


130 


140 


150 


160 


170 


180 


i 

DTPPPCPRCP 


1 

EPKSCDTPPP 


1 

CPRCPEPKSC 


1 

DTPPPCPRCP 


1 

APELLGGPSV 


1 

FLFPPKPKDT 


190 


200 


210 


220 


230 


240 

i 


1 

LMISRTPEVT 


1 

CVWDVSHED 


1 

PEVQFKWYVD 


1 

GVEVHNAKTK 


1 

PREEQYNSTF 


RWSVLTVLH 


250 


260 


2.70 


280 


290 


300 
1 


1 

QDWLNGKEYK 


1 

CKVSNKALPA 


1 

PIEKTISKTK 


1 

GQPREPQVYT 


1 

LPPSREEMTK 


NQVSLTCLVK 


310 


320 


330 


340 


350 


360 


. . 1 
GFYPSDIAVE 


1 

WESSGQPENN 


1 

YNTTPPMLDS 


1 

DGSFFLYSKL 


1 

TVDKSRWQGG 


1 

NIFSCSVMHE 



370 377 
I 

ALHNRFTQKS LSLSPGK 



[SEQUENCE ID NO: 24] 
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FIGURE 7N 



CODING SEQUENCE 

GCTTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCGCCCTGCTCCAGGAGCACCTCTGGGGGCACAGCGGCCCTG 
GGCTGCCTGGTCAAGGACTACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCCCTGACCAGCGGCGTG 
CACACCTTCCCGGCTGTCCTACAGTCCTCAGGACTCTACTCCCTCAGCAGCGTGGTGACCGTGCCCTCCAGCAGC 
TTGGGCACCCAGACCTACACCTGCAACGTGAATCACAAGCCCAGCAACACCAAGGTGGACAAGAGAGTTGAGCTC 
AAAACCCCACTTGGTGACACAACTCACACATGCCCACGGTGCCCAGAGCCCAAATCTTGTGACACACCTCCCCCG 
TGCCCACGGTGCCCAGAGCCCAAATCTTGTGACACACCTCCCCCATGCCCACGGTGCCCAGAGCCCAAATCTTGT 
GACACACCTCCCCCGTGCCCAAGGTGCCCAGCACCTGAACTCCTGGGAGGACCGTCAGTCTTCCTCTTCCCCCCA 
AAACCCAAGGATACCCTTATGATTTCCCGGACCCCTGAGGTCACGTGCGTGGTGGTGGACGTGAGCCACGAAGAC 
CCCGAGGTCCAGTTCAAGTGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAG 
TACAACAGCACGTTCCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAACGGCAAGGAGTACAAG 
TGCAAGGTCTCCAACAAAGGCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAACCAAAGGACAGCCCCGAGAA 
CCACAGGTGTACACCCTGCCCCCATCCCGGGAGGAGATGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAA 
GGCTTCTACCCCAGCGACATCGCCGTGGAGTGGGAGAGCAGCGGGCAGCCGGAGAACAACTACAACACCACGCCT 
CCCATGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGG 
AACATCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCGCTTCACGCAGAAGAGCCTCTCCCTGTCTCCG 
GGTAAATGA 

[SEQUENCE ID NO:23] 

GenBank 

X03604 Human C gamma 3 gene for IgG G3m(b) heavy chain C-region from EZZ 
(individual II -4 of TOU) PubMed, Protein, Related Sequences, Taxonomy, 
OMIM, LinkOut 

LOCUS HSIGGC3 2 5 90 bp DNA PRI 24 -NOV- 1998 

DEFINITION Human C gamma 3 gene for IgG G3m(b) heavy chain C-region from 
EZZ 

(individual II-4 of TOU) . 
ACCESSION X03604 M12958 
VERSION X03604.1 GI: 33070 

KEYWORDS constant region; gamma- immunoglobulin; Ig heavy chain; 

immunoglobulin. 
SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi 

Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 2590) 

AUTHORS Huck,S., Fort , P . , Crawford, D.H. . Lefranc,M.P. and Le franc. G. 

TITLE Sequence of a human immunoglobulin gamma 3 heavy chain constant 

region gene: comparison with the other human C gamma genes 

JOURNAL Nucleic Acids Res. 14 (4), 1779-1789 (1986) 

MEDLINE 8614 85 07 
REFERENCE 2 .(bases 4 to 204; 2202 to 2236) 

AUTHORS Takahashi , N . , Ueda,S., Obata.M., Nikaido.T., Nakai.S. and 
Honjo,T. 

TITLE ' Structure of human immunoglobulin gamma genes: implications for 

evolution of a gene family 
JOURNAL Cell 29 (2), 671-679 (1982) 
MEDLINE 83001943 
FEATURES Location/Qualifiers 
source 1 . .2590 

/organism- "Homo sapiens" 
/db_xr'ef «"taxon; 9606" 
conflict 28 

/note="T is missing in [2] • 
/citation^ [2) 
- conflict 80 . . 82 

/note="CGC is GCG in [2]" 



WO 03/064992 



22/91 



PCT/US02/34197 



FIGURE 70 

/citation= [2] 
conflict .120 

/note="C is T in [2] ■ 

/citation^ [2] 
conflict 135 

/note«"T is C in [2] " 

/citation^ [2 J 

CDS join<<215. .509,901. .951, 1095. .1139,1283. .1327, 

1471. . 1515, 1634 . . 1963 , 2061 . . >23 80) 
/ codon_s tart =1 
/product~"C gamma 3" 
/protein_id="CAA2 7 2 68 .1" 
/db_xref="GI : 577056" 

/trans la tion="ASTKGPSVFPLAPCSRSTSGGTAAIiGCLVl<I)YFPEPVTVSWNSG 

ALTSGVHTF PAVLQSSGLYS LS S WTVPS S SLGTQTYTCNVNHKPSNTKVDKRVELKT 



PLGDTTHTCPRCPEPKSCDTPPPCPRCPEPKSCDTPPPCPRCPEPKSCDTPPPCPRCP 
APELLGGP S VFLFP PKPKDTLMI S RTPEVTCVWDVSHEDPEVQFKWYVDGVEVHNAK 
TKPREEQYNSTFRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKTKGQPREP 
QVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDSDGSF 





FLYSKLTVDKSRWQQGNIFSCSVMHEALHNRFTQKSLSLSPGK" 


intron 


510. .900 




/note=" intron I" 


intron 


952 . . 1094 




/note="intron II" 


intron 


1140. .1282 




/note=" intron III" 


intron 


1328 . .1470 




/hote=" intron IV" 


intron 


1516. .1633 




' /note- "intron V" 


intron 


1964 . .2060 




/note*" intron VI" 


misc_f eature 


2484. .2489 




/note="pot. polyA signal" 


COUNT 541 


a 925 c 703 g 421 t 



ORIGIN 

1 agctttctgg ggcaggccag gcctgacttt ggctgggggc agggaggggg ctaaggtgac 

61 gcaggtggcg ccagccaggc gcacacccaa tgcccgtgag cccagacact ggaccctgcc 

121 tggaccctcg tggatagaca agaaccgagg ggcctctgcg ccctgggccc agctctgtcc 

181 cacaccgcag tcacatggcg ccatctctct tgcagcttcc accaagggcc catcggtctt 

241 ccccctggcg ccctgctcca ggagcacctc tgggggcaca gcggccctgg gctgcctggt 

3 01 caaggactac ttccccgaac cggtgacggt gtcgtggaac tcaggcgccc tgaccagcgg 

361 cgtgcacacc ttcccggctg tcctacagtc ctcaggactc tactccctca gcagcgtggt 

421 gaccgtgccc tccagcagct tgggcaccca gacctacacc tgcaacgtga atcacaagcc 

481 cagcaacacc aaggtggaca agagagttgg tgagaggcca gcgcagggag ggagggtgtc 

541 tgctggaagc caggctcagc cctcctgcct ggacgcatcc cggctgtgca gtcccagccc 

601 agggcagcaa ggcaggcccc gtctgactcc tcacccggag cctctgcccg ccccactcat 

661 gctcagggag agggtcttct ggctttttcc accaggctcc gggcaggcac aggctggatg 

721 cccctacccc aggcccttca cacacagggg caggtgctgc gctcagagct gccaaaagcc 

781 atatccagga ggaccctgcc cctgacctaa gcccacccca aaggccaaac tctctactca 

841 ctcagctcag acaccttctc tcttcccaga tctgagtaac tcccaatctt ctctctgcag 

901 agctcaaaac cccacttggt gacacaactc acacatgccc acggtgccca ggtaagccag 

961 cccaggactc gccctccagc tcaaggcggg acaagagccc tagagtggcc tgagtccagg 



WO 03/064992 PCT/US02/34197 

23/91 



FIGURE 7P 

agcagggtgc tgacgcatcc acctccatcc cagatccccg taactcccaa 
gcagagccca aatcttgtga cacacctccc ccgtgcccac ggtgcccagg 
caggcctcac cctccagctc aaggcaggac aagagcccta gagtggcctg 
caggccccag cagggtgctg acgcgtccac ctccacccca gatccccgta 
ttctctctgc agagcccaaa tcttgtgaca cacctccccc . atgcccacgg 
agccagccca ggcctcgccc tccagctcaa ggcgggacaa gagccctaga 
tccagggaca ggccccagca gggtgctgac gcatccacct ccatcccaga 
tcccaatctt ctctctgcag agcccaaatc ttgtgacaca cctcccccgt 
cccaggcaag ccagcccagg cctcgccctc cagctcaagg caggacaggt 
ggcctgcatc cagggacagg tcccagtcgg gtgctgacac atctgcctcc 
cagcacctga actcctggga ggaccgtcag tcttcctctt ccccccaaaa 
cccttatgat ttcccggacc cctgaggtca cgtgcgtggt ggtggacgtg 
accccgaggt ccagttcaag tggtacgtgg acggcgtgga ggtgcataat 
agccgcggga ggagcagtac aacagcacgt tccgtgtggt cagcgtcctc 
accaggactg gctgaacggc aaggagtaca agtgcaaggt ctccaacaaa 
cccccatcga gaaaaccatc tccaaaacca aaggtgggac ccgcggggta 
atggacagag gccagcttga cccaccctct gccctgggag tgaccgctgt 
gtccctacag gacagccccg agaaccacag gtgtacaccc tgcccccatc 
atgaccaaga accaggtcag cctgacctgc ctggtcaaag gcttctaccc 
gccgtggagt gggagagcag cgggcagccg gagaacaact acaacaccac 
ctggactccg acggctcctt cttcctctac agcaagccca ccgtggacaa 
cagcagggga acatcttctc atgctccgtg atgcatgagg ctctgcacaa 
cagaagagcc tctccctgtc tccgggtaaa tgagtgcgac agccggcaag 
ccgggctctc ggggtcgcgc gaggatgctt ggcacgtacc ccgtgtacat 
cacccagcat ggaaataaag cacccagcgc tgccctgggc ccctgtgaga 
tctttccacg ggtcaggccg agtctgaggc ctgagtgaca tgagggaggc 

[SEQUENCE ID NO:S6] 

VI. HUMAN IG GAMMA-4 CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE . 

>spjP0186l|GC4_HUMAN IG GAMMA-4 CHAIN C REGION - Homo sapiens (Human). 



10 


20- 


30 


40 


50 


60 


1 

ASTKGPSVFP 


1 

LAPCSRSTSE 


1 

■STAALGCLVK 


1 

DYFPEPVTVS 


1 

WNSGAXtTSGV 


1 

HTFPAVLQSS 


70 


80 


90 


100 


110 


120 


1 

GLYSLSSWT 


! 

VPSSSLGTKT 


1 

YTCNVDHKPS 


i 

NTKVDKRVES 


1 

KYGPPCPSCP 


1 

APEFLGGPSV 


130 


140 


150 


160 


170 


180 


i 

FLFPPKPKDT 


1 

LMISRTPEVT 


1 

CVWDVSQED 


1 

PEVQFNWYVD 


1 

GVEVHNAJCTK 


1 

PREEQFNSTY 


190 


200 


210 


220 


230 
1 

GQPREPQVYT 


240 


I 

RWSVLTVLH 


1 

QDWLNGKEYK 


I 

CKVSNKGLPS 


! 

SIEKTISKAK 


! 

LPPSOEEMTK 


250 


260 


270 


280 


290 


300 


1 

NQVSLTCLVK 


i 

GFYPSDIAVE 


1 

WESNGQPENN 


1 

YKTTPPVLDS 


1 

DGSFFLYSRL 


i 

TVDKSRWQEG 



1021 gacaggcccc 
1081 tcttctctct 
1141 taagccagcc 

12 01 agtccaggga 
1261 actcccaatc 
1321 tgcccaggta 

13 81 gcggcctgag 
1441 tccccgtaac 
1501 gcccaaggtg 
1561 gccctagagt 
1621 atctcttcct 
1681 cccaaggata 
1741 agccacgaag 
1801 gccaagacaa 
1861 accgtcctgc 
1921 gccctcccag 
1981 tgagggccac 
2041 gccaacctct 
2101 ccgggaggag 
2161 cagcgacatc 
2221 gcctcccatg 
22 81 gagcaggtgg 
2341 ccgcttcacg 
2401 cccccgctcc 
24 61 acttcccggg 
2521 ctgtgatggt 
2 5 81 agagcgggt c 
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FIGURE 7Q 



310 320 327 

I I ! 

NVFSCSVMHE ALHNHYTQKS LSLSLGK 



[SEQUENCE ID NO: 2 6] 



CODING SEQUENCE 



cttccaccaa 
gcacagccgc 
ggaacccagg 
gactctactc 
acacctgcaa 
aatatggtcc 
tcctgccccc 
gcgtggcggc 
gcgtggaggt 
gtgtggtcag 
gcaaggtctc 
ggcagccccg 
accaggtcag 
gggagagcaa 
acggctcctt 
atgtcttccc 
tctccctgtc 



gggcccatcc 
cctgggctgc 
cgccctgacc 
cctcagcagc 
cgtagaccac 
cccatgccca 
cccaaaaccc 
ggacgtgagc 
gcataatgcc 
cgtccccacc 
caacaaaggc 
agagccacag 
cctgacctgc 
tgggcagccg 
cttcctctac 
atgctccgtg 
tctgggtaaa 



gtcttccccc 
ctggccaagg 
agcggcgtgc 
gtggtgaccg 
aagcccagca 
tcatgcccag 
aaggacactc 
caggaagacc 
aagacaaagc 
gccctgcacc 
ctcccgtcct 
gtgtacaccc 
ctggtcaaag 
gagaacaacc 
agcaggctaa 
atgcatgagg 
tga 



tggcgccctg 
accacttccc 
acaccttccc 
tgccctccag 
acaccaaggt 
cacctgagtt 
tcatgatctc 
ccgaggtcca 
cgcgggagga 
aggactggct 
ccatcgagaa 
tgcccccatc 
gcttctaccc 
acaagaccac 
ccgtggacaa 
ctctgcacaa 



ctccaggagc 
cgaaccggtg 
ggctgtccta 
cagcttgggc 
ggacaagaga 
cctgggggga 
ccggacccct 
gttcaactgg 
gcagttcaac 
gaacggcaag 
aaccatctcc 
ccaggaggag 
cagcgacatc 
gcctcccgtg 
gagcaggcgg 
ccaccacaca 



acctccgaga 
acggtgtcgt 
cagtcctcag 
acgaagacct 
gccgagtcca 
ccatcagtct 
gaggtcacgt 
cacgtggatg 
agcacgtacc 
gagtacaagt 
aaagccaaag 
atgaccaaga 
gccgtggagt 
ctggactccg 
caggagggga 
cagaagagcc 

[SEQUENCE ID 



-1 
60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
983 
NO:2S] 



GenBank 

K01316. Human Ig germline 

LOCUS HUMIGCD2 20 28 bp DNA 

DEFINITION Human Ig germline H-chain G- 
3 • end . 
K01316 
K01316 .1 



PRI 

■E-A region B: gamma- 



ll-APR-2001 
constant region, 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 



Chordata; Craniaca; Vertebrata,- 
Primaces; Cacarrhini; Hominidae, 



Euteleostomi ; 
Homo . 



GI:184751 

human . 

Homo sapiens 
Eukaryota; Metazoa,- 
Mammalia; Eutheria; 

1 (bases 1 to 2028) 

Ellison, J., Buxbaum,J. and Hood.L. 

Nucleotide sequence of a human immunoglobulin C gamma 4 gene 

DNA 1 (1) , 11-18 (1981) 

83157104 

2 (bases 475 to 1069; 1180 to 1331; 1432 to 1655) 

Takahashi.N. , Ueda.S., Obata.M., Nikaido,T., Nakai,S. and Honjo.T. 

Structure of human immunoglobulin gamma genes: implications for 

evolution of a gene family 

Cell 29 (2), 671-679 (1982) 

83001943 

6811139 

3 (bases 894 to 1106) 
Krawinkel,U. and Rabbi tts.T.H. 

Comparison of the hinge-coding segments in human immunoglobulin 
gamma heavy chain genes and the linkage- of the gamma 2 and gamma 4 
subclass genes 

EMBO J. 1 (4) , 403-407 (1982) 

84235992 

6329676 

4 (bases 1 to 20 28) 
Ellison, J. and Hood,L. 

Linkage and sequence homology of two human immunoglobulin gamma 
heavy chain constant region genes 

ProC Natl. Acad. Sci . U.S.A. 79 (6), 1984-1988 (1982) 

B2197621 

6804948 
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[1] reports that the human C-gamma-4 gene is equally homologous to 
Che mouse gamma-1, gamma-2a, and gamma-2b genes . (about 75%). [3] 
also reports partial sequences for human gamma-2, gamma-3, and a 
gamma pseudogene. [2} presents the gamma-l, gamma-2, gamma-3, and 
pseudo-gamma hinge regions . 

This entry is part of a multigene region (region B) , which includes 
the gamma-2, gamma-4, epsilon-1, and alpha-2 genes. See segment 1 
for more comments. 
Complete source information: 

Human fetal liver DNA, library of T. Maniati3 [3 J and Lawn et al 



CI] . (2) ; 



FEATURES 

source 



gene 



.1 [3], and Ig-gamma-4-2 (2], 



intron 



CDS 



421 a 



clones 24B (lj , lambda-HG4 . 
Location/ Qualifiers 
1..2028 

/organism- "Homo sapiens" 

/db_xref«"t axon: 9606" 

/map-»14q32 .33" 

/germline 

<1..2028 

/gene«=»IgH" 

/note="IGHG4 " 

<1- .215 

/gene«"IgH" 

/note« "gamma-4 intron J-C" 

join{<216. .509,900. .935,1054. .1383,1481. .1803) 

/gene="IgH" 

/codon_start-3 

/product- "immunoglobulin gamma-4 heavy chain" 
/protein_id«"AAB593 94 . 1" 
/db_xrefa"GI : 184759" 

/translation*. "STKGPSVFPLAPCSRSTSESTAALGCLVIODYFPEPVTVSWNSGA 
LTSGVHTFPAVI^SSGLySLSSWTVPSSSLGTKTYTCNVDHKPSNTKVDKRVESKYG 
PPCPSCPAPEFLGGPSVFLFPPK^KDTU^ISRTPETVTCVWDVSQEDPEVQFNWYVDG 
VEVHNAKTKPREEQFNSTYRWSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKA 
KGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPV 
LDSDGS FFLYSRLTVDKSRWQEGNVFSCS VMHEALHNHYTQKS LSLSLGK • 
216 . .509 
/gene»"IgH" 
/note* "GOO- 119-340" 
510. .899 
/gene-*' IgH" 
900. .935 
/gene- "IgH" 
536. .1053 
/gene- "IgH" 
1054.. 1383 
/gene-" IgH" 
1384 . . 1480 
/gene» "IgH" 
1481. .1803 
/gene- "IgH" 
709 C 



567 g 



331 t 



BASE COUNT 
ORIGIN 

1 agctttctgg ggcaggccgg gcctgacttt ggctgggggc agggaggggg ctaaggtgac 
61 gcaggtggcg ccagccaggt gcacacccaa tgcccatgag cccagacact ggaccctgca 
121 tggaccatcg cggatagaca agaaccgagg ggcctctgcg ccctgggccc agctccgtcc 
181 cacaccgcgg tcacatggca ccacctctct tgcagcttcc accaagggcc catccgtctt 
241 ccccctggcg ccctgctcca ggagcacctc cgagagcaca gccgccctgg gctgcctggt 
3 01 caaggactac ttccccgaac cggtgacggt gtcgtggaac tcaggcgccc tgaccagcgg 
361 cgtgcacacc ttcccggctg tcctacagtc ctcaggactc tactccctca gcagcgcggt 
421 gaccgtgccc tccagcagc't tgggcacgaa gacctacacc tgcaacgtag atcacaagcc 
481 cagcaacacc aaggcggaca agagagctgg tgagaggcca gcacagggag ggagggtgcc 
541 tgctggaagc caggctcagc ccccccgcct ggacgcaccc cggctgtgca gccccagccc 
601 agggcagcaa ggcatgcccc atctgtctcc tcacccggag gcctctgacc accccactca 
661 tgctcaggga gagggtcttc tggatttttc caccaggctc ccggcaccac aggccggatg 
721 cccctacccc aggccctgcg catacagggc aggtgctgcg ctcagacccg ccaagagcca 
781 tatccgggag gaccctgccc ctgacctaag cccaccccaa aggccaaact ctccactccc 
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841 tcagctcaga caccetctct ccccccagac ctgagtaact cccaatcttc ectctgcaga 

901 gtccaaatac ggtcccccat gcccaccatg cccaggcaag ccaacccagg cctcgccctc 

961 cagctcaagg cgggacaggt gccctagagt agcctgcacc cagggacagg ccccagccgg 

1021 gtgctgacgc acccaccncc atctcctcct cagcacctga gttcccgggg ggaccaccag 

1081 tcttcctgtt ccccccaaaa cccaaggaca ctctcatgac ctcccggacc cctgaggcca 

1141 cgtgcgtggt ggtggacgtg agccaggaag accccgaggt ccagttcaac eggtacgegg 

1201 atggcgtgga ggtgcacaat gccaagacaa agccgcggga ggagcagt tc ' aacagcacgt 

1261 accgtgtggc cagcgtcctc accgccctgc accaggactg gctgaacggc aaggagcaca 

1321 agtgcaaggt ctccaacaaa ggcctcccgt cccccatcga gaaaaccatc tccaaagcca 

13 81 aaggtgggac ccacggggtg cgagggccac acggacagag . gccagcccgg cccacccccc 

1441 gccctgggag tgaccgctgc gccaacctct gtccctacag ggcagccccg agagccacag 

1S01 gtgtacaccc tgcccccatc ccaggaggag atgaccaaga accaggtcag cctgacctgc 

1561 ctggtcaaag gcttctaccc cagcgacacc gccgtggagc gggagagcaa cgggcagccg 

1621 gagaacaact acaagaccac gcctcccgtg ctggactccg acggctcctt cttcctctac 

1681 agcaggccaa ccgtggacaa gagcaggtgg caggagggga atgtcttctc acgccccgtg 

1741 atgcatgagg ccccgcacaa ccaccacaca cagaagagcc tctccctgcc cctgggcaaa 

1801 tgagtgccag ggccggcaag cccccgctcc ccgggctctc ggggccgcgc gaggatgctt 

1861 ggcacgtacc ccgtctacat acctcccagg cacccagcat ggaaataaag cacccaccac 

1921 tgccctgggc gcctgtgaga ctgtgatggt tctttccacg ggccaggccg agtctgaggc 

1981 ccgagcgaca cgagggaggc agagcgggtc ccaccgtccc cacactgg 

[SEQUENCE ID NO:57] 

VII. HUMAN IG DELTA CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>sp | P018 8 0 j DTC__HUMAN IG DELTA CHAIN C REGION - Homo sapiens (Human) . 

10 20 30 40 50 60 

I I I " I I I 

APTKAPDVFP IISGCRHPKD NSPWLACLI TGYHPTSVTV TWYMGTQSQP QRTFPEIQRR 

70 80 90 100 110 120 

i I i l i I . 

DSYYMTSSQL STPLQQWRQG EYKCWQHTA SKSKKEIFRW PESPKAQASS VPTAQPQAEG 

130 140 150 160 . 170 180 

i ! I ! I I 

SLAKATTAPA TTRNTGRGGE EKKKEKEKEE QEERETKTPE CPSHTQPLGV YLLTPAVQDL 

190 200 210 220 230 240 

I I I I I I 

WLRDKATFTC FWGSDLKDA HLTWEVAGKV PTGGVEEGLL ERHSNGSQSQ HSRLTLPRSL " 

250 260 270 280 290 300 

I ) I I I I 

WNAGTSVTCT LNHPSLPPQR LMALREPAAQ APVKLSLNLL ASSDPPEAAS WLLCEVSGFS 

310 320 330 340 350 360 

I II I I ! 

PPNILLMWLE DQREVNTSGF APARPPPQPG STTFWAWSVL RVPAPPSPQP ATYTCWSHE 
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370 380 

I I 
DSRTLLNASR SLEVSYVTDH GPM 



[SEQUENCE ID NO: 28] 



GenBan.lt 

K02876. Human germline IgD. [gi : 1847663 PubMed, Protein, Related Sequences. 
Taxonomy, OMIM, LinkOut 



PRI 08 -NOV- 1994 

C-region, first hinge domain. 



LOCUS HUMIGCH02 3 00 bp DNA 

DEFINITION Human germline IgD-chain gene, 
ACCESSION K028 76 
VERSION K02876.1 GI: 184766 

KEYWORDS C-region; germline; hinge exon; immunoglobulin heavy chain; 

immunog 1 obu 1 in - de 1 t a . 
SOURCE Homo sapiens ( individual_isolate Chronic lymphocytic leukemia <CLL) 

patient) DNA. 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria,- Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 Co 300) 

AUTHORS White, M.B.. Shen.A.L.. Word .C.J. , Tucker, P. W. and Blattner , F.R. 
TITLE Human immunoglobulin D: genomic sequence of the delta heavy chain 

JOURNAL Science 228 (4700), 733-737 (1985) 
MEDLINE 8519252 2 
COMMENT See segment 1. 

FEATURES Location/Qualifiers 
source 1 . .300 

/organisms "Homo sapiens" 

/isolate-"Chronic lymphocytic leukemia (CLL) patient" 

/db_xref -"taxon: 9606" 

/map-"14q32 . 33 " 

/cell_type«" lymphocyte" 

/germline 
intron <1..151 

/gene-"IGHD" 

/note- "GOO -120 -084" 

/number- 1 
exon 101 . .202 

/partial 

/gene»"IGHD" 

/notes -hinge- 1 domain; GOO-120-084" 
/number»2 
intron 203 . . >300 

/gene-"IGHD" 

/ note- "G00 - 12 0 - 084 " 

/number»2 

gene join (K02 875 . 1 : 1 . .495,1. . 300 , K02B77 . 1 : 1 . .300, 

K02 878.1:l. . S00 , K02879 . 1 : 1 . . 500 , K02880 . 1 : 1 . .100, 

K02881.1:l. .200.K02882.1:1. .52) 

/gene«"IGHD" 

CDS join (K02875. 1:101. .403,101. . 202 , K02877 . 1 : 101 . .172. 

K02 878. 1:101. . 424 , K02 879 . 1 : 101 . . 424 , K02881 . 1 : 25 . .182. 

K02882.1:44. .52) 

/partial 

/gene="lGHD" 

/note- "membrane bound form" 
/codon_start -3 

/product-" immunoglobulin delta-chain" 
/protein_id- ■ AAAS2 771 . 1 » 
/db_xref-"GI : 4 95872" 
/db_xref-"GDB:G00-120-0B4" 

/ translation-" PTKAPDVFPIISGCRHPKDNSPWLACLITGyHPTSVTVTWYMG 
TQSQ PQRTF P E I QRRDS YYMTS SQLST P LQQWRQGEYKCWQHTAS KS KKE I FRW PE S 
P KAQAS S VPTAQ PQAEGS LAKATTAP ATTRNTGRGGE E KKKEKE KE EQE ERETKT P EC 
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PSHTQPLGVYLLTPAVQDLWLRDKATFTCFWGSDLKDAHLTWEVAGKVPTGGVEEGL 
LERHSNGSQSQHSRLTLPRSLWNAGTSVTCTLNHPSLPPQRLMAIJIEPAAQAPVKLSL 
NLLASSDPPEAASWLLCEVSGFS P PNI LLMWLEDQREVNTSGFAPARPP PQPRSTTFW 
AWSVLRVPAPPSPQPATYTCWSHEDSRTLLNASRSLEVSYLAMTPLIPQSKDENSDD 
YTTFDDVGS LWTTLS T FVALF I LTLL YSGI VT FIKVK" 

[SEQUENCE ID NO: 3 0] 

CDS . join{K02875. 1:101. .403,101. .202, K02877. 1:101. .172, 

K02878 .1 : 101. . 4 24 , K02879 . 1 : 101 . . 424 . K0288O . 1 : 2 5 . .53) 

/partial 

/gene»"IGHD" 

/note-" secreted form" 

/codon_start-3 ' 

/product= B immunoglobulin delta-chain" 
/protein_id= "AAA52770 .1" 
/db_xref-"GI :495871" 
/db_xref="GDB:G00-120-084" 

/translation-" PTKAPDV FPU SGCRHP KDNS PWLACL I TG YHPTS VTVTWYMG 
TQSQPQRTFPEIQRRDSYYMTSSQLSTPLQQWRQGEYKCWQHTASKSKKEIFRWPES 
P KAQ AS SVPTAQ PQ AEGS LAKATT AP ATTRNTGRGGEEKKKEKE KEEQE ERETKTPE C 
PSHTQ P LGVYLLT P AVQDLWLRDKAT FTCFWGSD LKDAHLTW EVAGKVPTGGVEEGL 
LERHSNGSQS QHS RLTL P RS LWNAGTS VTCTLNH P S LP PQRLMALRE P AAQAP VKLS L 
NLLASSDPPEAASWLLCEVSGFS PPNI LLMWLEDQREVNTSGFAPARPP PQPRSTTFW 
AWSVLRVPAP PS PQPATYTCWSHEDSRTLLNASRSLEVS YVTDHGPMK" 
BASE COUNT 59 a 133 C 52 g 56 t 

ORIGIN About 300 bp after segment 1? 118 bp upstream of Stul site. 

1 taggctgcct gtgcccccca cctgcctgtc cacaacccag cctctggtac atccatgccc 
61 tctgccctaa gcctcacctg cacttttcct tggatttcag agtctccaaa ggcacaggcc 
121 tcctccgtgc ccactgcaca accccaagca gagggcagcc tcgccaaggc aaccacagcc 
161 ccagccacca cccgtaacac aggtgagaag ccccttccct gcacactcca cccccaccca 
241 cctgctcatt cctcagccgc ctcctccagg cagcccttca taactccttg cctgagtctc 

[SEQUENCE ID NO: 27] 

K02877. Human Ig germline ... [gi : 184767] PubMed, Protein, Related Sequences, 
Taxonomy, OMIM, LinkOut 

LOCUS HUMIGCH03 300 bp DNA PRI OB-NOV-1994 

DEFINITION Human Ig germline delta H-chain C-region gene, second hinge domain 

(CLL lymphocyte) . 
ACCESSION K02877 
VERSION K02877.1 GI : 184767 

KEYWORDS C- region; germline; hinge exon; immunoglobulin heavy chain,- 

immunoglobulin-delta . 
SOURCE Homo sapiens ( individual_isolate Chronic lymphocytic leukemia (CLL) 

patient) DNA. . 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria,- Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 3 00) 

AUTHORS White, M.B., Shen,A.L.. Word, C J., Tucker, P. w. and Blattner. F.R. 
TITLE Human immunoglobulin D: genomic sequence of the delta heavy chain 

JOURNAL Science 228 (4700), 733-737 (1985) 
MEDLINE 85192522 
COMMENT See segment 1 . 

FEATURES Location/Qualifiers 
source 1. .3 00 

/organism- "Homo sapiens" 

/isolate- "Chronic lymphocytic leukemia (CLL) patient" 

/db_xref «"taxon: 9606" 

/map«"l4q3 2 .33" 

/cell_type« " lymphocyte" 

/germline 
intron <1..100 

/gene- " IGHD" 

/note- "G00 -120- 084 • 

/number«2 
exon 101.. 172 
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/gene- M IGHD"' 

/note- "hinge -2 domain; GOO -120- 084; putative" 
/number^ 3 
intron 173 . . >300 

/gene«"IGHD" 
/note»"G00-120-064'* 
/number^ 3 

gene join (K02B75 . 1 : 1 . .495,K02876.1:1. .300,1. .300, 

K02878.1:l. . 5 00,K02 879.1:1. . 500 , K02B80 . 1 : 1 . .100, 

K0288 1.1:1. . 200 , K02 882 . 1 : 1 . .52) 

/gene«"IGHD« 

CDS join (K0287S. 1:101. . 4 03 . K02 87S . 1 : 101 . .202,101. .172, 

K028 78.1 : 101 . . 4 24 , K02 879 . 1 : 101 . . 42 4 , K02B81 . 1 : 25 . .182. 

K02882.1:44. .52) 

/partial 

/gene»"IGHD" 

/ not e» "membrane bound form" 
/codon_scart=3 

/product*" immunoglobulin delta-chain" 
/protein_id="AAA52 771 . 1" 
/db_xref-"GI : 4 958 72" 
/ db_xr e f - " GDB : GO 0 - 12 0 - 0 8 4 " 

/translations " PTKAPDVFPIISGCRHPKDNSPVVTACLITGYHPTSVTVTWYMG 
TQSQPQRTFPEIQRRDSYYMTSSQLSTPLQQWRQGEYKCVVQHTASKSKKEIFRWPES 
P KAQ AS S VPTAQ PQAEGS LAKATT AP ATTRNTGRGGEE KKKE KE KEEQE ERETKT P E C 
PSHTQPLGVYLLTPAVQDLWLRDKATFTCFVVGSDLKDAHLTWEVAGICVPTGGVEEGL 
LERHSNGSQSQHSRI-TLPRSLWNAGTSVTCTIiNHPSLPPQRLMALREPAAQAPVKLSL 
NLLASSDPPEAAS WLLCEVSGFS P PNI LLMWLEDQREVNTSGFAPARPPPQPRSTT FW 
AWSVLRVPAPPSPQPATYTCWSHEDSRTLLNASRSLEVSYLAMTPLIPQSKDENSDD 
YTTFDDVGSLWTTLSTFVALFILTLLYSGIVTFIKVK" 
CDS join(K02875. 1:101. . 403, K02876. 1:101.. 202, 101.. 172, 

K02878 .1 :101 . . 424 , K02879 . 1 : 101 . . 4 24 , K02880 . 1 : 25 . .53) 

/partial 

/gene«"IGHD" 

/note»"secreted form" 

/codon_scart»3 

/products "immunoglobulin delta -chain" 
/protein_id="AAA52 770 .1" 
/db_xref«="GI : 495871" 
/db_xref="GDB:G0O-l2O-O84" 

/ C rans la t ion- " PTKAPDVFP 1 1 SGCRHPKDNS PWLACLI TGYHPTSVTVTWYMG 
■TQSQPQRTFPEIORRDSYYMTSSQLSTPLQQWRQGEYKCWQHTASKSKKEIFRWPES 
PKAQASSVPTAQPQAEGSIAKATTAPATTRNTGRGGEEKKKEKEKEEQEERETKTPEC 
PS HTQ P LGVYLLT PAVQDLWLRD KAT FTCFWGS DLKDAHLTWEVAGKVPTGGVEEGL 
LERHSNGSQSQHS RLTL PRS LWNAGTS VTCTLNHP S L P PQRLMALRE P AAQAPVKLS L 
NLIJ^SDPPEAASWLLCEVSGFSPPNILIJWLEDQREVNTSGFAPARPPPQPRSTTFW 
AWSVLRVPAPPSPQPATYTCWSHEDSRTLLNASRSLEVSYVTDHGPMK" 
BASE COUNT 102 a 52 c 70 g 76 C 

ORIGIN About 1.85 kb after segment 2. 

1 gtcattagct ggatttagcc attccacaat gtacacatat ttcaaacatt gtgttgtaca 
61 tgataaacac gtataatttt cgtcaactaa aaatttttag gaagaggagg agaagagaag 
121 aagaaggaga aggagaaaga ggaacaagaa gagagagaga caaagacacc aggctttttc 
181 tgacccctgg gctatcaaaa cacctattgc ccaataacta gccggccgtt ggtgccctaa 
241 actactgaag cgattgctgt tatgtggatg ggccccggac acctagaaac tcgtgacccc 

[SEQUENCE ID NO: 29] 
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K02878. Human germline IgD. [gi : 184768] PubMed, Protein. Related Sequences, 
Taxonomy, OMIM, LinkOut 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 

COMMENT 

FEATURES 

source 



PRI 08-NOV-1994 
C-delta-2 domain. 



intron 



intron 



intron 



gene 



CDS 



HUMIGCH04 500 bp DNA 

Human germline IgD chain gene, C-region 
K02878 

K02B78.1 GI:184768 

C- region; germline; immunoglobulin heavy chain; 
immunoglobulin-delta . 

Homo sapiens ( individual_isolace Chronic lymphocytic leukemia (CLL) 
patient) DNA. 
Homo sapiens 

Eukaryota,- Metafcoa,- Chordata; Craniata,- Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae,* Homo. 
1 (bases 1 co 500) 

White, M.B., Shen.A.L., Word. C. J.. Tucker, P. W. and Blattner , F .R . 

Human immunoglobulin D: genomic sequence of the delta heavy chain 

Science 228 (4700), 733-737 (1985) 

85192522 

See segmenc 1 . 

Locat ion/Qual i f ie r s 

1. .500 

/organism- "Homo sapiens" 

/isolate-"Chronic lymphocytic leukemia (CLL) patient" 

/db_xref-"taxon: 9606" 

/map»"14q32 .33 " 

/cell_cype» • lymphocyte ■ 

/ germline 

<1. .100 

/gene-"IGHD" 

/noceo "G00- 12 0-084 " 

/number- 3 

101 . .424 
/gene="IGHD" 

/noce»"C-delca-2 domain; GOO-120-084" 
/number- 4 
42S . .>500 
/genet* "IGHD" 

/noce-"G00-120-0B4" .* 
/number" 4 
425 . . >500 
/gene-" IGHD" 
/noce-"IgD-s intron D" 
join (K02 875 .1:1 . .495 , K028 76 .1:1. 
1. .500.K02B79. 1 :1 . . 500 , K02880 . 1 ; 
K02882 .1 :1. .52) 
/gene- "IGHD" 

j Oin ( K02 875. 1 : 101. .403. K02876 . 1:101. .202, 
K02877 .1 :101. .172.101. . 424 , K02 879 . 1 : 101 . .424, 
K02881.1:25. . 182 , K02882 . 1 : 44 . .52) 
/partial 
/gene-" IGHD" 

/note- "membrane bound form" 
/codon_starc»3 

/product - " immunoglobul in del t a - chain " 
/protein_id-"AAAS2 77l . l" 
/db_xref-"GI :4 95872" 
/db~xref-"GDB: GOO -12 0-084" 

/ 1 rans 1 a t ion- • PTKAPDVFP 1 1 SGCRHP KDNS PVVLACLITGYHPTSVTVTWYmg 
TOSQPQRTFPEIQRRDSYYMTSSQLSTPLQQWRQGEYKCWQHTASKSKKEIFRWPES 
PKAQASSVPTAQPQAEGSLAKATTAPATTRNTGRGGEEKKKEKEKEEOEERETKTPEC 
PSHTQPLGVTLLTPAVQDLWLRDKATFTCFVVGSDLKDAHLTWEVAGKVPTGGVEEGL 
LERHSNGSQS QHS RLTL P RS LWN AGTS VTCTLNH P S LP PQRLMALRE P AAQAP VKLSL 
NLLASSDPPEAASWl^LCEVSGFSPPNILLMWLEDQREVNTSGFAPARPPPOPRSTTFW 
AWSVLRVPAPPSPQPATYTCWSHEDSRTLLNASRSLEVSYLAMTPLIPQSKDENSDD 
YTT FDDVGS L WTTLST FVAL Fl LTLLY SG I VT F I KVK " 



.300,K02877.1:1. .300, 
1. .100.K028Bl.lil. .200, 
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CDS join(K0287S. 1:101. . 403 , K02876 . 1 : 101 . .202, 

K02877. 1:101. .172,101. . 424 , K02 8 79 . 1 : 101 . .424, 

K02860 .1:25. .53) 

/partial 

/gene-"IGHD" 

/note*. "secreted form" 

/codon_start»3 

/product =" immunoglobulin delta -chain" 
/protein_id-"AAAS2 770. 1" 
/db_xref «"GI : 4 958 71" 
/db_xref»"GDB:G00-120-084 B 

/ 1 rans la t ion- - PTKAPDVFP 1 1 SGCRHPKDNS PWLACLI TGYHPTSVTVTWYMG 
TQS QPQRTF P E I Q RRDS YYMTS SQLS T P LQQWRQGE YKCWQHTAS KS KKE I FRW PES 
PKAQASSVPTAQPQAEGSLAKATTAPATTRNTGRGGEEKKKEKEKEEQEERETKTPEC 
PSHTQPLGVYH.TPAVQDLWLRDKATFTCFWGSDLKDAHLTWEVAGKVPTGGVEEGL 
LE RHSNGSQS QHS RLTLP RS LWNAGTS VTCTLNHPS L P PQRLMALRE PAAQAPVKLS L 
NLLAS SD P P E AAS WLLCE VSGFS P PN I LLMWLEDQREVNTSGFAP ARP P PQ P RS TT FW 
AWSVLRVPAP PS PQPATYTCWSHEDSRTLLNASRSLEVSYVTDHGPMK" 
BASE COUNT 93 a 171 c 157 g 79 t 

ORIGIN About 4 50 bp after segment 3; 131 bp upstream of AccI site. 

1 gaagctgggg agaggagagc acagtggtta agtcagtccc tgcagcccaa ctgctcccga 
61 aggtccggcc acagctgctc tcgtttgctc tcccctgcag agtgtccgag ccacacccag 
121 cctcttggcg tctacctgct aacccctgca gtgcaggacc tgtggctccg ggacaaagcc 
181 accttcacct gcttcgtggt gggcagtgac ctgaaggatg ctcacctgac ctgggaggtg 
241 gctgggaagg tccccacagg gggcgtggag gaagggctgc tggagcggca cagcaacggc 
3 01 tcccagagcc agcacagccg tctgaccctg cccaggtcct tgtggaacgc ggggacctcc 
361 gtcacctgca cactgaacca tcccagcctc ccaccccaga ggttgatggc gctgagagaa 
421 cccggtgagc ctggctccca ggtggggaga cgagggtgcc cacagcctgc tgacccctac 
481 gcccgcccca gggccatgac 

[SEQUENCE ID NO: 3 0] 

K02879. Human Ig germline ... [gi : 184769] PubMed, Protein, Related Sequences, 
Taxonomy, OMIM, LinkOut 

LOCUS HUMIGCH05 500 bp DNA PRI 08 -NOV- 19 94 

DEFINITION Human Ig germline delta H-chain C-region gene, C-delta-3 domain 

( CLL lymphocyte) . 
ACCESSION K02B79 
VERSION K02879.1 GI: 184769 

KEYWORDS C-region; germline; immunoglobulin heavy chain; 

immunoglobulin- delta . 
SOURCE Homo sapiens (individual_isolate Chronic lymphocytic leukemia (CLL) 

patient) DNA. 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 500) 

AUTHORS White, M.B., Shen.A.L.. word.C.J., Tucker, P. W. and Blattner, F.R. 
TITLE Human immunoglobulin D: genomic sequence of the delta heavy chain 

JOURNAL Science 228 (4700), 733-737 (198S) 
MEDLINE 85192522 
COMMENT See segment 1. 

FEATURES Location/Qualifiers 
source 1. .500 

/organism- "Homo sapiens" 

/isolate- "Chronic lymphocytic leukemia (CLL) patient" 

/ db_xr e f « " t axon : 9 6 0 6 " 

/map-"14q32 .33" 

/cell_type» "lymphocyte" 

/germline 
intron <1 . . 100 

/gene-"IGHD" 

/note- "GO 0-120- 084" 

/number»4 
exon 101.. 424 

/gene-"IGHD" 
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/noteo-c-delta-3 domain; G00-120-084; putative" 
/ number- 5 
incron 425 . - >500 

/gene- - IGHD" 

/note="G00-120-084" 

/number»5 

gene join (K0287S . 1 : 1 . . 4 95 , K02876 . 1 : 1 . . 3 00 , K02877 . 1 : 1 . .300, 

K02878.1:l. .500, 1. . 500 , K02 8 80 . 1 : 1 . . 100 , K02881 . 1 : 1 . .200, 
K02882.1:l. .52) 
/gene- "IGHD" 

CDS join{K02875. 1:101. .403, K02876. 1:101. -202, 

K02877. 1:101. . 1 72 . K02878 . 1 : 101 . .424,101. .424, 
K02881\l :25. .182,K02882 .1 :44 . .52) 
/partial 
/geneo " IGHD" 

/note- "membrane bound form" 
/codon_start»3 

/product* "immunoglobulin delta -chain" 
/protein_id-"AAA5277l .1" 
/db_xref-"GI :495872" 
/db_xref-"GDB:G00-120-084" 

/translation-" PTKAPDV FPUS GCRH P KDNS PWLACLI TGYHPTS VTVTWYMG ' 
TQSQPQRTFPEI GRRDS YYMTS SQLS T P LQQWRQGE YKCVVQHT AS KSKKEI FRWpES 
PKAQASSVPTAQPQAEGSLAKATTAPATTRNTGRGGEEKKKEKEKEEQEERETKTPEC 
PSHTQPIA3VYLLTPAVQDLWLRDKATFTCFWGSDLKDAHLTWEVAGKVPTGGVEEGL 
L ERHSNGS QS QHSRLTL PRS LWNAGTS VTCTI^THP S LP PQRLNtALRE P AAQ APVKLS L 
NLLAS SD P PEAAS WLLCEVSG FS PPNI LLMWLEDQREVNTSG F AP AR P PPQPRSTTFW 
AWS VLRVPAP PS PQPATYTCWSHEDSRTLLNASRSLEVS YLAMTPL I PQS KDENSDD 
YTTFDDVGS LWTTLSTFV AL F I LTLLYSG I VT F I KVK " 
CDS join{K02875. 1:101. -403. K0287S .1:101. .202, 

K02877. 1:101. . 172 , K02 878 . 1 : 101 . . 424 , 101 . .424, 

K02880.1:2S. .53) 

/partial 

/gene- "IGHD" 

/note- "secreted form" ■ 

/codon_start-3 

/product -"immunoglobulin delta -chain" 
/protein_id-"AAA52770. 1" 
./db_xref»"GI :495871" 
/db_xref-"GDB:G00-120-084" 

/translation-" PTKAPDVFP 1 1 SGCRHPKDNS PWLACLI TGYHPTSVTVTWYMG 
TQSQ PQRTFP E I QRRDS YYMTS SQLS TP LQQWRQGE YKCWQHT AS KSKKE I FRW PES 
P KAQ AS S VPTAQ PQAEGS LAKATTAP ATTRNTGRGGEE KKKE KE KE EQE ERETKTP E C 
P SHTQ P LGVYLLT P AVQDL WLRD KATFTC FWGS D L KDAHLTWEVAGKVPTGGVE EGL 
LERHSNGSQSQHSRLTLPRSLWNAGTSVTCTLNHPSLPPQRLMALREPAAQAPVKLSL 
NLLASSDPPEAASWLLCEVSGFS PPNI LLMWLEDQREVNTSGFAPARP PPQPRSTTFW 
AWSVLRVPAPPSPQPATYTCWSHEDSRTLLNASRSLEVSYVTDHGPMK" 
BASE COUNT 85 a 188 c 145 g 82 t 

ORIGIN About 150 bp after segment 4; 118 bp upstream of Hindlll site. 

1 ccacaggaaa ggagaaggga ggcaccacac cctggccggc cccacttctc tcccagtgcc 
61 cccgtggcca gagcctgaca gcccccccac ctccccgcag ctgcgcaggc acccgtcaag 
121 ctttctctga acctgctggc ctcgtctgac cctcccgagg cggcctcgtg gctcctgtgt 
181 gaggtgtctg gcttctcgcc ccccaacatc ctcctgatgt ggctggagga ccagcgtgag 
241 gtgaacactt ctgggtttgc ccccgcacgc ccccctccac agcccaggag caccacgttc 
301 • tgggcctgga gtgtgctgcg tgtcccagcc ccgcccagcc ctcagccagc cacctacacg 
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361 tgtgtggtca gccacgagga ctcccggacc ctgcecaacg ccagccggag cctagaagtc 
421 agctgtgagt cacccccagg ccagggttgg gacggggact ctgagggggg ccataaggag 
481 ctggaatcca tactaggcag 

[SEQUENCE ID NO: 33] 

K01311. Human IgD germline ... tgi : 184716] PubMed, Protein, Taxonomy, OMIM 

LOCUS HUMIGCB9 106 bp DNA PRI 12 -APR- 2001 

DEFINITION Human IgD germline chain J-delca region: C-delta CHI . 
ACCESSION K01 3 11 
VERSION K01311.1 GI;1847l6 

KEYWORDS C- region; germline; immunoglobulin heavy chain; 

immunoglobulin -delta . 
SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 106) 

AUTHORS Rabbitts,T.H. , Forster,A. and Milstein, C . P . 

TITLE Human immunoglobulin heavy chain genes: evolutionary comparisons of 

C mu. C delta and C gamma genes and ' associated switch sequences 
JOURNAL Nucleic Acids Res. 9 (18). 4509-4524 (1981) 
MEDLINE 82059479 
PUBMED 6795593 

COMMENT The deduced amino acid sequence is compared in [1] to the 

J/C-delta-1 junction of human ER1 protein. The delta gene occurs 
only S kb from the mu region. The authors [lj could not detect any 
switch-related sequences adjacent to the delta gene and state that 
this implies that the mu/delta switch cannot occur by the class 
switch recombination method. They speculate that the entire 
VH- (C-mu) - (C-delta) region is transcribed into one nuclear 
precursor molecule which is spliced later. 

This is part of a multigene region containing the J-region, switch 
region, C-mu-secreted, C-mu-membrane , and C-delta genes. 
FEATURES • Location/Qualifiers 
source 1 . . 106 

/organism= "Homo sapiens" 

/db_xref«"taxon: 9606" 

/map-"14q32 . 33 " 

/eel l_type«e " lymphocyte " 

/tis8ue_type» "placenta" 

/ tissue_type= "liver" 

/dev_stage» ■ foetus" 

/germline 

/tissue_lib«"of Lawn et al . " 
gene 1 . . 106 

/gene-"IGHD n 
intron <l . . 26 

/geneo»iGHD" 

/note- "intron delta J-C; GOO-120-084" 
CDS <27..>106 

/gene»"iGHD" 

/note«"C-region CHI domain" 
/codon_start»3 

/ produc t » " i mmunog 1 obul in delta-chain" 
/protein_id-"AABS9423 .1" 
/ db_xr e f - " GI -. 1 8 4 7 3 5 " 

/ 1 rans 1 a t ion» " PTKAPDVFP 1 1 SGCRHPKDNS PWLA " 
BASE COUNT 24 a 38 c 24 g 20 t 

ORIGIN 

1 tgccacccca ggactctgtc ttccagcacc caccaaggct ccggatgtgt tccccatcat 
61 atcagggtgc agacacccaa aggataacag ccctgtggtc ctggca 

[SEQUENCE ID NO: 58) 

K02880. Human germline IgD ... (gi : 184770} PubMed, Protein, Related Sequences, 
Taxonomy, OMIM, LinkOut 
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LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

SOURCE 



REFERENCE 
AUTHORS 
TITLE 

• JOURNAL 
MEDLINE 

COMMENT 

FEATURES 

source 



PRI 08-NOV-1994 
C-region, secreted terminus. 



HUMIGCH06 100 bp DNA 

Human germline IgD chain gene, 
K02880 
K02880.1 GI:184770 

C-region; germline; immunoglobulin heavy chain; 
immunoglobulin -delta. 

Homo sapiens { individual_isolate Chronic lymphocytic leukemia (CLL) 
patient) DNA. 
Homo sapiens 

Eukaryota; Metazoa; Chordata,- Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 {bases 1 to 100) 

White, M.B., Shen,A.L., Word. C.J. , Tucker, P. W. and Blattner , F . R. 

Human immunoglobulin D: genomic sequence of the delta heavy chain 

Science 226 (4700), 733-737 (198S) 

85192522 

See segment l . 

Locat ion/Qual if iers 

1. .100 

/organism* "Homo sapiens" 

/isolate- "Chronic lymphocytic leukemia (CLL) patient" 
/ db_xr e f - n t axon : 9606" 
/map«"14q3 2 .33" 
/cell_cype= 11 lymphocyte " 
/germline 
<1 . .>100 
/gene="IGHD" 
/note- "GOO -120-084" 
/number- 5 
' <1. .24 
/gene="IGHD" 
/note- "GOO -120 -08 4" 
/number-5 
25. .>S3 
/gene-"IGHD" 

/note- "secreted terminus domain; G00-12O-O84" 
/number- 6 

1:1. .495.K02876.1 :1 . . 3 00 . K02877 . 1 : 1 . .300, 
500,K02S79 .l;l . .500,1. . 100 , K02881 . 1 : 1 - .200, 
52) 



join(K02875 . 
KQ2878.1:1. . 
K02682 .1:1. . 
Vgene-'IGHD' 
join(K02875 . 



1 : 101 . 



BASE COUNT 
ORIGIN 



. 4 03, K02B 76. 1:101. .202, 
K02877. 1:101. . 172, K02 878. 1:101. . 424 , K02B79 . 1 : 101 . .424, 
25. .53) 
/partial 
/gene»"IGHD" 
/note- -secreted form" 
/codon_start«3 

/product- "immunoglobulin delta -chain" 
/protein_id= n AAAS277 0.i« 
/db_xref-"GI:49S871- 
/db_xref»"GDB:GOO-120-084" 

/translation-" PTKAPDV FP 1 1 S GCRH P KDNS PWLACL I TGYHPTSVTVTWYMG 
TQSQPORTFPEIQRRDSYYMTSSQLSTPLQQMRQGEYKCWQHTASKSKKEIFRWPES 
P KAQ AS S VPTAQ PQ AEGS LAKATTAP ATTRNTGRGGEE KKKE KE KEEQEERETKTP E C 
PSHTQPIX^TfLLTPAVQDLWLRDKATFTCFVVGSDLKDAHLTWEVAGKVPTGGvEEGL 
LE RHSNGS QSQHSRLTL PRS LWN AGTSVTCTLNHP S L P PQRLMALRE PAAQAPVKLSL 
NLLASSDPPEAASWLLCEVSGFSPPNII^LMWLEDQREVNTSGFAPARPPPQPRSTTFW 
AWSVLRVPKPPSPQPATYTCWSHEDSRTLLNASRSLEVSYVTDHGPMK" 
a 33 c 22 g 21 t 

8 kb after segment 5 . 
1 gacacgccga ttttttgtta ttagatgtaa cagaccatgg ccccatgaaa tgatcccgga 
61 ccagatccgt ccgcacccgc cactcagcag ctctggccga 

[SEQUENCE ID NO: 3 6] 



24 

About 1. 
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K02881. Human germline IgD. .. tgi : 184771] PubMed, Procein, 
Taxonomy. OMIM, LinkOut 



Related Sequences, 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 

COMMENT 

FEATURES 

source 



PRI 08-NOV-1994 
C- region, first domain of membrane 



intron 



gene 



HUMIGCH0 7 200 bp DNA 

Human germline IgD-chain gene, 
terminus . 
K02881 

K02S81.1 GI:184771 

C- region; germline,- immunoglobulin heavy chain; 
immunoglobulin- delta . 

Homo sapiens (individual_isolate Chronic lymphocytic leukemia (CLL) 
patient) DNA. 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ,- 
Mammalia; Eutheria.- Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 200) 

white, M.B., Shen,A.L., Word, C. J., Tucker, P. w. and Blattner, F.R. 

Human immunoglobulin D: genomic sequence of the delta heavy chain 

Science 228 (4700) , 733-737 <198S) 

85192522 

See segment 1 . 

Location/Qualifiers 

1. .200 

/organism- "Homo sapiens" 

/isolate="Chronic lymphocytic leukemia (CLL) patient" 

/db_xref»"taxon: 9606" 

/map=-14q32 .33 " . 

/ eel l_type»" lymphocyte" 

/germline 

<1 . .24 

/gene-"IGHD " 
/note»"G00-120-0 84" 
/number»S 
25 . .182 
/gene»"IGHD" 

/note="first domain of membrane terminus; GOO-120-084; 

putative" 

/number- 6 

163 . . >200 

/gene»"IGHD" 

/note- "GOO -120 -084" 

/number- 6 

join (K02 875 .1:1. . 4 95 , K028 76 . 1 : 1 . 



.300,K02877 .1 :1. .300. 



BASE COUNT 



K02878.1:l. . 500 , K02B79 . 1 : 1 . . 500 , K02880 . 1 : 1 . .100,1. .200, 

K02882 .1 :1 . .52) 

/gene»"IGHD"- 

join (K02875. 1:101. . 403 , K02 8 76 . 1 : 101 . .202, 

K02877. 1:101. .172,K02878. 1:101. . 424 , K02 8 79 . 1 : 101 . .424, 

25. .182,K02882 .1 :44 . .52) 

/partial 
/gene«*IGHD" 

/note- "membrane bound form" 
/codon_start-3 

/product-" immunoglobulin delta-chain" 
/protein_id- "AAA52771 . 1 " 
/db_xref-"GI : 495872" 
/db_xref- n GDB:G0O-120-O84 " 

/ 1 rans 1 a t ion- " PTKAPDVFP 1 1 SGCRHP KDNSPWIACLITGyHPTSVTVTWYMG 
TQSQPQRTF PE I QRRDS YYMTS S QLS T P LQQWRQGE Y KCWQHTAS KS KKE I FRW PES 
PKAQASSVPTAQPQAEGSI^KATTAPATTRNTGRGGEEKKKEKEKEEQEERETKTPEC 
PSHTQPLGVYLLTPAVQDLWLRDKATFTCFVVGSDLKDAHLTWE\'AGKVPTGGVEEGL 
LERHSNGSQ S QHS RLTL PRSLWNAGT S VTCTLNHP S LP PQRLMALRE P AAQAPVKLS L 
NLLAS S D P P EAAS WLLCEVS GFSPPNI LLMWLEDQREVNTSG F AP ARP PPQPRSTTFW 
AWSVLRVPAPPS PQPATYTCWSHEDSRTLLNASRSLEVS YLAMTPLI PQSKDENSDD 
YTTFDDVGS LWTTLST FVALFI LTLLY SGI VTFI KVK ■ 
37 a 72 c 49 g 42 t 
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ORIGIN About 800 bp after segment 6. 

1 cgctcggccc ccgttcctcc ccagacctgg ccatgacccc cctgatccct cagagcaagg 
61 atgagaacag cgatgaccac acgacctttg atgatgtggg cagcctgtgg accaccctgt 
121 ccacgtttgt ggccctcttc atcctcaccc tcctctacag cggcattgtc actttcatca 
161 aggtcagggg agcggccagg 

I SEQUENCE ID NO: 3 8) 

K02882. Human germline IgD ... [gi : 184772] PubMed, Protein, Related Sequences, 
Taxonomy, OMIM, LinkOut 



LOCUS HUMIGCH0 8 100 bp DNA PRI 08 -NOV- 1994 

DEFINITION Human germline IgD-chain gene, C- region, second domain of membrane 

terminus . 
ACCESSION K02 882 
VERSION K02882.1 GI : 184772 

KEYWORDS C- region; germline; immunoglobulin heavy chain ; 

immunog 1 obul i n - de 1 1 a. . 
SOURCE Homo sapiens ( individual_isolate Chronic lymphocytic leukemia <CLL) 

patient) DNA. 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini Hominidae; Homo. 
REFERENCE 1 (bases 1 to 100) 

AUTHORS White,M.B., Shen.A.L., Word. C. J.. Tucker, P. W. and Blattner, F.R, 
TITLE Human immunoglobulin D: genomic sequence of the delta heavy chain 

JOURNAL Science 228 (4700), 733-737 (1985) 
MEDLINE 85192S22 
COMMENT See segment 1 . 

FEATURES Location/Qualifiers 
source 1 . . 100 

/organism- "Homo sapiens" 

/isolate«"Chronic lymphocytic leukemia (CLL) patient" 

/db_xref-" taxon: 9606" 

/map»"14q32 .33" 

/ c e 1 1 _ t ype » " 1 ymphocy t e " 

/germline 
intron <l . .43 

/gene- M IGHD" 

/note-"IgD-Mb" 

/number- 6 
exon . 44 . . >S2 

/gene-"IGHD" 

/note» -membrane -bound form (second domain of membrane 
terminus); GOO-120-084; putative" 
/number- 7 

gene join (K0287S . 1 : 1 . . 49S , K02B76 . 1 : 1 . . 300 , K02 877 . 1 : 1 . .300. 

K0287B.1:!. . 500 , K0287 9 . 1 : 1 . . S00 . K028 80 . 1 : 1 . .100, 

K02881 .1 :1. .200, 1 . .52) 

/gene«"IGHD" 

CDS join (K02875 .1:101. . 403 , K02876 . 1 : 101 . .202. 

K02877. 1:101 . . 172 , K02878 . 1 : 101 . . 4 24 , K028 79 . 1 : 101 . .424, 

K02881.1:25. .182, 44 . .52) 

/partial 

/gene-"IGHD" 

/note- "membrane bound form" 
/codon_start-3 

/product- "immunoglobulin delta-chain" 
/protein_id»"AAA5277i .1- • 
/db_xref-"GI: 4 95872" 
/db_xref «"GDB:G0O-120-0S4 ■ 

/ 1 rans 1 a t ion- ■ PTKAPDVFP 1 1 SGCRHPKDNS PWLACLI TGYHPTSVTVTWYMG 
TQS Q PQRT F PE I QRRDS YYMTSSQLST P LQQWRQGEYKCWQHTAS KS KKE I FRW P ES 
PKAQASSVPTAQPOAEGSLAKATTAPATTRNTGRGGEEKKKEKEKEEQEERETKTPEC 
PSHTQPLGVYLLTPAVQDLWLRDKATFTCFWGSDLKDAHLTWEVAGKVPTGGVEEGL 
LERHSNGSQSQHSRLTLPRSLWNAGTSVTCTLNHPSLPPQRLMALREPAAQAPVKLSL 
NLLASSDP PEAAS WLLCEVSGFS PPNILLMWLEDQREVNTSGFAPARFP PQPRSTTFW 
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AWSVLRVPAPPSPQPATYTCWSHEDSRTLLNASRSLEVSYLAMTPLIPQSKDENSDD 
YTTFDDVGS LWTTLSTFVALFI LTLLYSGI VTF I KVK" 
BASE COUNT 22 a 30c 30 g 18 t 

ORIGIN About 1.3 kb after segment 7. 

l tcaggcttct agcccctgtc tgaccccagg ggctgtcttt caggtgaagt agccccagaa 
61 gagcaggacg ccctgtacct gcagagaagg gaagcagcct 

[SEQUENCE ID NOs4 0] 

•K02875. Human germline IgD. .. (gi : 184765] PubMed, Related Sequences, Taxonomy. OMIM, 
LinkOut . 



LOCUS 

DEFINITION 
ACCESSION . 
VERSION 
KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 

COMMENT 



FEATURES 

source 



HUMIGCH01 495 bp DNA PRI 08-NOV-1994 

Human germline IgD chain gene, C-region, C-delta-1 domain. 
K0287S 

K02875.1 GI :184765 

C- region; germline; immunoglobulin heavy chain,- 
immunoglobulin-delta . 

Homo sapiens { individual_isolate Chronic lymphocytic leukemia {CLD 
patient) DNA - 
Homo sapiens 

Eukaryoca; Metazoa; Chordata,- Craniata; vertebrata.- Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 495) 

White, M.B., Shen,A.L., Word, C. J., Tucker, P. W. and Blatcner.F.R. 
Human immunoglobulin D: genomic sequence of the delta heavy chain 
Science 228 (4700), 733-737 (1985) 
85192S22 

Sequence in computer readable form and draft entry for [1] were 
kindly provided by M.B.White, 06-AUG-1985. 

The C-delta and delta -s exo'n boundaries were located by comparing 
the translated sequences with known AA sequences (1] . 

Location/Qualifiers 

1..49S 

/organism- "Homo sapiens" 

/isolate»"Chronic lymphocytic leukemia (CLL) patient" 

/db_xref-"taxon: 9606" 

/map»-i4q32 .33" 

/ eel l_type- " lymphocyte " 

/germline 

jointl. -495,K02876.1:1. . 300 , K02877 . 1 : 1 . .300, 

K02878.1:l. . 500 , K02 879 . 1 : 1 . . 500 , K02 880 . 1 : 1 . .100, 

K02 881.1:1. .200,K02 882.1;1. .52) 

/gene-"IGHD" - 

<1. .100 

/gene»"IGHD" 

/note="J-C intron; G00-120-084" 

join (101. . 403, K02876. 1:101. . 202 , K02877 . 1 : 101 . .172, 

K02B78. 1:101. . 424 , K02879 . 1 : 101 . . 4 24 , K0288 0 . 1 : 25 . .53) 

/partial 

/gene* "IGHD" 

/note-"secreted form" 

/codon_start-3 

/product- "immunoglobulin delta- chain" 
/protein_id»"AAAS2770. 1" 
/db_xref «"GI : 495871" 
/db_xref- "GDB: GO 0-120 -084" ' 

/ 1 ran slation-" PTKAP DVFPII SGCRH P KDNS PWLACL I TG YHPTS VTVTWYMG 
TQSQPQRTFPEIQRRDSYYMTSSQLSTPLQQWRQGEYKCVVQHTASKSKKEIFRWPES 
P KAQ AS S V PTAQPQAEGS LAKATTAP ATTRNTGRGGEEKKKE KEKEEQE ERETKT P EC 
P S HTQ PLG VYLLT PAVQDLWLRDKAT FTCFWGSDLKDAHLTWEVAGKVPTGGVEEGL 
LERHS NGSQSQHSRLTLP RS LWNAGTSVTCTLNHPS LP PQRLMALRE PAAQAPVKLS L 
NLLASSDPPEAASWLLCEVSGFSPPNILLMWLEDQREVNTSGFAPARPPPQPRSTTFW 
AWSVLRVPAP PS PQPATYTCWSHEDSRTLLNASRSLEVS YVTDHGPMK " 
101 . .403 
/gene«"IGHD" 

/note="C-delta-l domain; GOO-120-084; putative" 
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/number- 1 

CDS join (101. . 403 ,K02876. 1:101. . 2 02 , K02 8 77 . 1 : 101 . -172, 

K0287B. Is 101. .4 24,K02 87 9.1:101. . 424 , K02881 . 1 : 25 . .182. 

K02882.1:44. .52) 

/partial 

/gene= M IGHD" 

/note- "membrane bound form" 
/ codon^s t a r t = 3 

/produce-" immunoglobulin delta -chain" 
/pro tein_id-"AAA5 2771 .1" 
/ db_xref » "GI : 4 95872 •* 
/db_xref - "GDB : GO 0-120-0 84" 

/ translation- " PTKAPDVFPIISGCRHPKDNSPWLACLITGYHPTSVTVTWYMG 
TQS Q PQRT FPE I QRRDS YYMTS SQLST P LQQ WRQGEYKCWQHTAS KS KKE I FRW PES 
PKAQASSVPTAQPQAEGSLAKATTAPATTRNTGRGGEEKKKEKEKEEQEERETKTPEC 
PSHTQPLGVYLLTPAVQDLWLRDKATFTCFWGSDLKDAHLTWEVAGKVPTGGVEEGL 
LERHSNGSQSQHSRLTLPRSLWNAGTS\rrCTLNHPSLPPQRLMALREPAAQAPVKLSL 
NLIASSDPPEAASWLLCEVSGFSPPNILLMWLEDQREVNTSGFAPARPPPQPRSTTFW 
AWSVIJIVPAPPSPQPATYTCVVSHEDSRTLLNASRSLEVSYLAMTPLIPQSKDENSDD 
YTTFDDVGSLWTTLSTFVALF-ILTLLYSGI VTFI KVK" 
intron 404 . . >495 

/gene- " IGHD" 
/note- "GOO- 120-0 84 " 
/number- 1 

BASE COUNT 114 a 179. c 120 g 82 t 

ORIGIN 182 bp upstream of SphI site; chromosome 14q3 2.3. 

1 tttccctgcc tcccgtcacc ctgccgccag ggcctctgcc ctgccctgcc ccttgtcctc 
61 aggtttccag cctcagactc ccactgtgtc tgtcttccag cacccaccaa ggctceggat 
121 gtgttcccca tcatatcagg gtgeagacac ccaaaggata acagccctgt ggtcctggca 
181 tgcttgataa ctgggtacca cccaacgtcc gtgactgtca cctggtacat ggggacacag 
241 agccagcccc agagaacctt ccctgagaca caaagaeggg acagctacta catgacaagc 
3 01 agccagctct ccacccccct ccagcagtgg cgccaaggcg agtacaaatg cgtggtccag 
361 cacaccgcca gcaagagtaa gaaggagatc ttccgctggc caggtaggtc geaceggaga 
421 tcacccagaa gggcccccca ggacccccag caccctccac teagggectg accacaaaga 
481 cagaagcaag ggctg 

[SEQUENCE ID NO: 42] 



VIII. HUMAN IG EPSIL,ON CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>splP01B54 }EPC_HUMAN IG EPSILON CHAIN C REGION - Homo sapiens (Human). 



10 


20 


30 


40 


50 


60 


I 

ASTQSPSVFP 


1 

LTRCCKNIPS 


1 

NATSVTLGCL 


1 

ATGYFPEPVM 


1 

VTWDTGSLNG 


I 

TTMTLPATTL 


70 


80 


90 


100 


110 


120 


1 

TLSGHYATIS 


1 

LLTVSGAWAK 


1 

QMFTCRVAHT 


1 

PSSTDWVDNK 


1 

TFSVCSRDFT 


1 

PPTVKILQSS 


130 


140 


150 


160 


170 


180 


I 

CDGGGHFPPT 


1 

IQLLCLVSGY 


1 

TPGTINITWL 


1 

EDGQVMDVDL 


1 

STASTTQEGE 


1 

LASTQSELTL 


190 


200 


210 


220 


230 


240 


1 

SQKHWLSDRT 


1 

YTCQVTYQGH 


1 

TFEDSTKKCA 


1 

DSNPRGVSAY 


1 

LSRPSPFDLF' 


1 

IRKSPTITCL 


250 


260 
1 


270 


280 


290 


300 


1 

WDLAPSKGT 


VNLTWSRASG 


1 

KPVNHSTRKE 


1 

E KQ RNGTLTV 


1 

TSTLPVGTRD 


1 

WIEGETYQCR 
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310 320 330 340 350 360 

- I I I I I I 

VTHPHLPRAL MRSTTKTSGP RAAPEVYAFA TPEWPGSRDK RTLACLIQNF MPEDISVQWL 

370 380 390 400 410 420 

I I ! I I I 

HNEVQLPDAR HSTTQPRKTK GSGFFVFSRL EVTRAEWEQK DEFICRAVHE AASPSQTVQR 

428 
I 

AVSVNPGK 



(SEQUENCE ID NO s 49 J 



CODING SEQUENCE 

atggactgga cctggatccc cttcttggtg gcagcagcca cgcgagtcca 

cagtcggcgc agectggggc tgaggtgagg aagcccgggg caccagtgag 

aaggcttctg gatacaccct catcgactcc tatatccact ggatacgaca 

cacgggcttg agtgggtggg atggaccaac cctaacagcg gtggcacaaa 

agatttcagg gcagggtcac catgaccaga gacgcgtcct tcagtacagc 

ccgagaagtc tgagacccga cgactcggcc gtgttttact gtgcgaaaag 

tggagtgatt actataactt tgactactcg tacactttgg acgtccggg'g 

acggccaccg tctcctcagc ctccacacag agcccacccg tctccccctc 

tgcaaaaaca ttccctccaa tgccacctcc gtgactctgg gctgcctggc 

ttcccggagc cggtgatggt gacccgggac acaggccccc tcaacgggac 

ttaccagcca ccaccctcac gctctccggt cactatgcca ccatcagctt 

tcgggtgcgt gggccaagca gatgttcacc tgccgcgtgg cacacacccc 

gactgggtcg acaacaaaac ctccagcgtc tgctccaggg acttcacccc 

aagatcttac agtcgtcctg cgacggcggc gggcacttcc ccccgaccat 

tgcctcgtct ctgggtacac cccagggact atcaacatca cctggctgga 

gtcatggacg tggacttgtc caccgcctct accacgcagg agggtgagct 

caaagcgagc tcaccctcag ccagaagcac tggctgtcag accgcaccca 

gccacccatc aaggtcacac ctctgaggac agcaccaaga agtgtgcaga 

agaggggcga gcgcctacct aagccggccc agcccgttcg acctgttcat 

cccacgatca cctgtctggt ggtggacccg gcacccagca aggggaccgt 

cggccccggg ccagcgggaa gcccgcgaac cactccacca gaaaggagga 

aatggcacgt caaccgccac gtccaccctg ccggtgggca cccgagaccg 

gagacccacc agtgcagggt gacccacccc cacctgccca gggccctcat 

accaagacca gcggcccgcg tgctgccccg gaagtccatg cgtttgcgac 

ccggggagcc gggacaagcg caccctcgcc tgcctgatcc agaacttcat 

atctcggtgc agcggctgca caacgaggtg cagctcccgg acgcccggca 

cagccccgca agaccaaggg ctccggcttc t tcgtcttca gccgcctgga 

gccgaacggg agcagaaaga tgagttcatc tgccgtgcag tccatgaggc 

tcacagaccg tccagcgagc ggcgcccgca aaccccggta aatga 



ctcccagacg 


60 


ggtctcctgc 


120 


ggcccccggg 


180 


ctatgctccg 


240 


ccacacggac 


300 


tgaccctttt 


360 


ccaagggacc 


420 


gacccgctgc 


480 


cacgggctac 


540 


aactatgacc 


600 


gctgaccgtc 


660 


accgtccaca 


720 


gcccaccgtg 


7B0 


ccagctcctg 


840 


ggacgggcag 


900 


ggcccccaca 


960 


cacccgccag 


1020 


ttccaacccg 


1080 


ccgcaagtcg 


1140 


gaacctgacc 


1200 


gaagcagcgc 


1260 


gatcgagggg 


1320 


gcggtccacg 


1380 


gccggagtgg 


1440 


gcctgaggac 


1500 


cagcacgacg 


1560 


ggcgaccagg 


1620 


agcgagcccc 


1680 




1725 


[SEQUENCE ID 


NO:44] 



Gen Bank 

L00022. Human Ig active he. 



LOCUS 
DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

SOURCE 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 



HUMIGHAE2 1920 bp DNA PRI 22-DEC-1994 

Human Ig active heavy chain epsilon-l gene, constant region. 
L00022 J00227 V00555 
L00022.1 GI:18503S 

C-region; epsi Ion- immunoglobulin; immunoglobulin heavy chain; 
processed gene. 

Human myeloma cell line 266B1 DNA and cDNA to mRNA, clones 
H-Ig-epsilon-11 , lambda - eps ilon-1 . 2 , pJJ71, pGET2 and K85/A12 (see 
comment) . 
Homo sapiens 

Eukaryota; Metazoa-; Chordata; Craniata; Vertebrata; Euteleostomi 

Mammalia; Eucheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1920) 

Flanagan, J. G. and Rabbit ts , T . H . 

The sequence of a human immunoglobulin epsilon heavy chain constant 
region gene, and evidence for three non-allelic genes 
EMBO J. 1 (5), 655-660 (1982) 
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MEDLINE 8423 6029 
REFERENCE 2 (bases 528 to "736; 1044 to 1138) 

AUTHORS Nishida,Y., Miki , T . , Hisajima.H. and Honjo,T. 

TITLE Cloning of human immunoglobulin epsilon chain genes: evidence for ' 

multiple C epsilon genes 
JOURNAL Proc. Natl. Acad. Sci . U.S.A. 79 (12), 3833-3837 (1982) 
MEDLINE 8224 7945 
REFERENCE 3 (bases 1 to 1920) 

AUTHORS Kent en , J. H. , Molgaard, H. V. , Houghton.M., Derbyshire , R . B . , Viney.J., 

Bell.L.O. and Gould. H.J. 
TITLE Cloning and sequence determination of the gene for the human 

immunoglobulin epsilon chain expressed in a myeloma cell line 
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79 (21), 6661-6665 (1982) 
MEDLINE 83 06 523 4 
REFERENCE 4 (bases 98 to 1884) 

AUTHORS Seno,M., Kurcka wa , T . , Ono, Y . Onda, H . , Sasada,R., Igarashi,K., 

Kikuchi,M., Sugino.Y., Nishida.Y. and Honjo.T. 
TITLE Molecular cloning and nucleotide sequencing of human immunoglobulin 

epsilon chain cDNA 
JOURNAL Nucleic Acids Res. 11 (3), 719-726 (1983) 
MEDLINE 83168897 
REFERENCE 5 (bases 691 to 807; 1571 to 1818; 1860 to 1885) 
AUTHORS Liu.F.T., Albrandt , K. A. , Bry.C-G. and Ishizaka.T. 

TITLE Expression of a biologically active fragment of human IgE epsilon 

chain in Escherichia coli 
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (17), 5369-5373 (1984) 
MEDLINE 84298140 

COMMENT [21 and [1] report the isolation of two other epsilon genes, 

epsilon-2 and epsilon-3 . The authors in [2] claim that epsilon-3 is 
a pseudogene. Compared in [4] with the germline C-region sequence 
by Max, et al (Cell 29, 691-699 (1982)), and there are three 
nucleotide differences. The deduced amino acid sequence in [4] 
differs somewhat from the published C-region sequence. [5 3 produced 
expression of IgE in E.coli by insertion into expression vector 
pUC7 . 

Complete source information: 

Human myeloma cell line 266B1 DNA [2] . [13 , (5} and cDNA to mRNA [3] , 
[4], clones H-Ig-epsilon-11 [2], lambda-epeiion-1 . 2 [1], pJJ71 [3], 
pG£T2 [4] and K85/A12 [5] . 

Location/Qualifiers 
1 . .1920 

/organism- "Homo sapiens" 
/ db_xr e f - " t axon : 9 6 0 6 ■ 
/map-"14q32.33" 
prim^transcript <1..1886 

/note="epsilon-l mRNA" 
<1..97 

/gene»"IGHE" 

/note- "epsilon-1 intron J-C" 
98 . .406 
/gene-"IGHE" 

/note»"Ig heavy chain epsilon-1 (CHI domain); G00-119-335" 
407. .613 
/gene-"IGH£" 

/note«"epsiloh-l intron A" 
614. .934 
/gene-"IGHE" 

/note="Ig heavy chain epsilon-1 (CH2 domain)" 
conflict 73 5 

/gene»"IGHE" 
/citation- (33 
/replaces*" 
intron 935 . . 1020 

/geneo-IGHE" 

/not e= "epsilon-1 intron B " 
exon 1021 . . 1344 



FEATURES 

source 



intron 



exon 



intron 
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/gene«"IGHE" 

/note-'lg heavy chain epsilon-1 (CH3 domain)" 
conflict 1124 

/gene-"IGHE" 

/citation- (3] 

/replace-" " 
conflict 1337 

/gene- M IGHE" 

/citation- [3 ] 

/replace- " " 
intron 1345 . .1427 

/gene-"IGHE" 

/note-" eps i Ion- 1 intron C" 
exon 1428..*1759 

/note= M Ig heavy chain epsilon-1 (CH4 domain)" 
conflict 1444.. 1445 

/gene« u IGHE " 

/citation- (3j 

/replace-" " 
.conflict 1612 

/gene«"IGHE" 

/citation- {3] 

/replace-" " 
conflict 1785 

/citation- [3] 

/replace-" " 

gene join {L00021 . 1 : 57 . .495,1. .1758) 

/gene-"IGHE" 

CDS join (L00021. 1:57. .495,98. .406,614. .934, 1021. .1344, 

1428. .1759) 
/partial 
/gene-"IGHE" 

/note-"Ig heavy chain epailon-1 (V-D-J region)" 

/ codon_s t art - 1 

/protein_id» "AAB59424 . l - 

/db_xref « n GI : 386807" 

/ db_xr e f = " GDB : GO 0-119-335" 

/ 1 r ans 1 a t i on- " MDWTW I LFLVAAATRVHSQTQLVQSGAEVRKPGAS VRVSCKASG 
YTFIDSYI HW I RQAPGHGLEWVGWINPNSGGTNYAPRFQGRVTMTRDAS FSTAYMDLR 
S LRS DOS AVFY CAKSD P FWS DYYN FD YS YTLDVWGQGTTVTVSS AS TQ S PS V F PLTRC 
CKN I P SNATS VTLGCIATG YF P EP VMVTWDTGS LNGTTMTLP ATTLTLSGHYAT ISLL 
TVS GAWAKQM FTCRVAHTP S S TDWVDNKT FSVCS RD FT P PTVKI LQS S CDGGGH FP PT 
I QLLCLVS GYT PGT INI TWL EDGQVMDVDLSTASTTQEGE LASTOS ELTLSQ KHWLSD 
RTYTCQVTYQGHTFEDSTKKCADSNPRGVSAYLSRPSPFDLFIRKSPTITCLWDLAP 
S KGTVNLTWS RASGKPVNHS TRKEE KQRNGTLTVTSTL PVGTRDW I EG ETYQCRVTHP 
HLPRAIiMRSTTKTSGPRAAPEVYAFATPEWPGSRDKRTLACLIQNFMPEDISVQWLHN 
EVQLPDARHSTTQPRKTKGSGFFVFSP^EVTRAEWEQKDEFICRAVHEAASPSQTVQR 
AVSVNPGK" 

[SEQUENCE ID NO: 60] 



BASE COUNT 387 a 658 

ORIGIN About 3 kb after 

1 ggatccctgc cacggggtcc 
61 gcctgaggct ggcactgact 
121 ttccccttga cccgctgctg 
181 tgcctggcca cgggctactt 
241 aacgggacaa ctatgacctt 
301 atcagcttgc tgaccgtctc 
361 cacactccat cgtccacaga 
421 agctcagaga ccacagttcc 
481 tgagggggtg ggtgggctca 
541 acgtgggggc aagaggaggg 
601 agctctcccc cagtctgctc 
661 tcctgcgacg gcggcgggca 
721 tacaccccag ggactatcaa 
781 ttgtccaccg cctctaccac 



576 g 299 t 
1 bp upstream of BamHI site, 

ccagctcccc catccaggcc ccccaggctg atgggcgctg 

aggttctgtc ctcacagcct ccacacagag cccatccgtc 

caaaaacatt ccctccaatg ccacctccgt gactctgggc 

cccggagccg gtgacggtga cctgggacac aggctccctc 

accagccacc accctcacgc tctctggtca ccatgccacc 

gggtgcgtgg gccaagcaga tgttcacctg ccgtgtggca 

ctgggtcgac aacaaaacct tcagcggtaa gagagggcca 

caggagtgcc aggctgaggg ctggcagagt gggcaggggt 

aacgtgggaa cacccagcat gcctggggac ccgggccagg 

cacacagagc tcagagaggc caacaaccct catgaccacc 

cagggacttc accccgccca ccgtgaagac cttacagtcg 

cttccccccg accatccagc tcctgcgcct cgtctctggg 

catcacctgg ctggaggacg ggcaggtcat ggacgtggac 

gcaggagggt gagctggcct ccacacaaag cgagctcacc 



WO 03/064992 



42/91 



PCT/US02/34197 



FIGURE 711 

841 ctcagccaga agcactggct gtcagaccgc acctacacct gccaggccac ctatcaaggt 

901 cacacctctg aggacagcac caagaagtgt gcaggcacgc ccccacccgc ccrggtggcc 

961 gccacggagg ccagagaaga ggggcgggtg ggcctcacac agccccccgg tgtaccacag 

1021 atcccaaccc gagaggggcg agcgcctacc taagccggcc cagcccgctc gacctgctca 

1081 cccgcaagtc gcccacgacc acctgtctgg cggcggaccc ggcacccagc aaggggaccg 

1141 tgaacctgac ccggtcccgg gccagtggga agcctgtgaa ccactccacc agaaaggagg 

1201 agaagcagcg caatggcacg ttaaccgtca cgtccaccct gccggcgggc acccgagact 

1261 ggaccgaggg ggagacctac cagtgcaggg cgacccaccc ccacctgccc agggccctca 

1321 tgcggtccac gaccaagacc agcggtgagc catgggcagg ccggggtcgt gggggaaggg 

13 81 agggagcgag tgagcggggc ccgggctgac cccacgtctg gccacaggcc cgcgtgctgc 

1441 cccggaagtc tatgcgtttg cgacgccgga gtggccgggg agccgggaca agcgcaccct 

1501 cgcctgcccg atccagaact tcatgcctga ggacacctcg gtgcagtggc tgcacaacga 

1S61 ggtgcagctc ccggacgccc ggcacagcac gacgcagccc cgcaagacca agggctccgg 

1621 cttcttcgtc ttcagccgcc tggaggtgac cagggccgaa tgggagcaga aagatgagtc 

1681 catctgccgt gcagtccatg aggcagcgag cccctcacag accgtccagc gagcggtgcc 

1741 tgtaaatccc ggtaaatgac gtacccccgc ccccctccct cccagggccc cacccagctg 

1801 tgcagtgggg aggactggcc agaccctccg tccactgctg caatgacccc aggaagctac 

1861 ccccaataaa cngtgcctgc tcagagcccc agtacaccca ttcttgggag cgggcagggc 

[SEQUENCE ID NO: 5 9] 

IX. HUMAN IG MU CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>sp| P01871 |MUC_HUMAN IG MU CHAIN C REGION - Homo sapiens (Human). 



10 


20 


30 


40 


50 


60 


1 

GSASAPTLFP 


1 

LVSCENSPSD 


1 

TSSVAVGCLA 


1 

QDFLPDSITF 


1 

SWKYKNNSDI 


1 

SSTRGFPSVL 


70 
1 


80 

i 


90 


100 


110 


120 


RGGKYAATSQ 


VLLPSKDVMQ 


I 

GTDEHWCKV 


1 

QHPNGNKEKN 


1 

VPLPVIAELP 


1 

PKVSVFVPPR 


130 


140 


150 


160 


170 


180 


1 

DGFFGNPRSK 


1 

SKLICQATGF 


1 

SPRQIQVSWL 


1 

REGKQVGS GV 


1 

TTDQVQAEAK 


i 

ESGPTTYKVT 


190 
1 


200 


210 


220 


230 


240 


STLTIKESDW 


1 

LSQSMFTCRV 


1 

DHRGLTFQQN 


1 

ASSMCVPDQD 


1 

TAIRVFAIPP 


1 

SFASIFLTKS 


250 


260 


270 


280 


290 


300 


1 

TKLTCLVTDL 


1 

TTYDSVTISW 


1 

TRQNGEAVKT 


1 

HTNISESHPN 


1 

ATFSAVGEAS 


1 

ICEDDWNSGE 


310 


320 


330 


340 


350 


360 


1 

RFTCTVTHTD 


1 

LPSPLKQTIS 


1 

RP KG VALHRP 


1 

DVYLLPPARE 


1 

QLNLRESATI 


1 

TCLVTGFS PA 


370 


380 


390 


400 


410 


420 


1 

DVFVQWMQRG 


1 

QPLSPEKYVT 


1 

SAPMPEPQAP 


1 

GRYFAHSILT 


1 

VSEEEWNTGE 


1 

TYTCWAHEA 


430 


440 


450 








1 

LPNRVTERTV 


1 

DKSTGKPTLY 


1 

NVS LVKSDTA 


GTCY 







[SEQUENCE ID NOt47] 



CODING SEQUENCE 

atggaccgga cctggaggtt cctccttgcg gtggcagcag ccacaggtgt ccagccccag 60 

gcgcagccgg cgcagtctgg ggccgaggcg aagaagcctg ggtccccggc gaaggtcccc 120 

tgcaaggcct ctggaggcac cttcagcagc catgccacca gctgggtgcg acaggcccct 180 

ggacaagggc ttgagcggat gggagggacc atccctatct tcggtacagc aaaccacgca 240 

cagaagctcc agggcagagt cacgaccacc gcggacgaat ccacgagcac agcccacacg 300 
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gagctgagca gcctgagatc tgaggacacg gccgtgtatt actgtgcgaa aaccgggatc 360 

ccggggccgt atagcagtgg ctggtacccg aactcggact actactacta cggtatggac 420 

gtctggggcc aagggaccac ggtcaccgtc tcctcaggga gtgcatccgc cccaaccctt 480 

tccccccccg tcccccgtga gaattccccg ccggacacga gcagcgtggc cgccggccgc 540 

cccgcacagg actcccctcc cgaccccatc actttctcct ggaaatacaa gaacaactct 600 

gacatcagca gcacccgggg ctccccatca gtcctgagag ggggcaagca cgcagccacc 660 

tcacaggtgc tgctgccttc caaggacgcc atgcagggca. cagacgaaca cgtggcgcgc 720 

aaagtccagc accccaacgg caacaaagaa aagaacgtgc ctcctccagt gattgctgag 780 

ctgcctccca aagtgagcgt ccccgtccca ccccgcgacg gcttcttcgg caacccccgc 840 

agcaagtcca agctcatccg ccaggccacg ggtttcagtc cccggcagac tcaggtgtcc 900 

tggctgcgcg aggggaagca ggtggggtct ggcgccacca cggaccaggt gcaggctgag 960 

gccaaagagt ctgggcccac gacctacaag gtgaccagca cactgaccat caaagagagc ' 1020 

gactggctca gccagagcat gttcacctgc cgcgtggatc acaggggcct gaccttccag 1080 

cagaatgcgt cccccatgcg tgtccccgat caagacacag ccatccgggc ccccgccatc 1140 

cccccatcct ttgccagcat cttcctcacc aagtccacca agttgacccg cccggccaca 1200 

gacctgacca cctatgacag cgtgaccatc Ccctggaccc gccagaacgg cgaagctgtg 1260 

aaaacccaca ccaacatccc cgagagccac cccaatgcca ctttcagcgc cgtgggtgag 13 20 

gccagcatct gcgaggatga ctggaattcc ggggagaggt ccacgtgcac cgtgacccac 13 80 

acagacctgc cctcgccact gaagcagacc atctcccggc ccaagggggc ggccctgcac 1440 

aggcccgatg tctacctgct gccaccagcc cgggagcagc tgaacccgcg ggagtcggcc 1500 

accaccacgc gcccggcgac gggcttctct cccgcggacg tcttcgcgca gtggacgcag 1560 

a 9ggggcagc ccctgccccc ggagaagtat gtgaccagcg ccccaatgcc tgagccccag 1620 

gccccaggcc ggtacttcgc ccacagcatc ctgaccgtgt ccgaagagga atggaacacg 1680 

ggggagacct acaccCgcgt ggeggcccat gaggccccgc ccaacagggc caccgagagg 1740 

accgtggaca agcccaccga gggggaggcg agcgccgacg aggagggctt tgagaacctg 1800 

tgggccaccg cctccacccc catcgtcccc ctcctcctga gcctccccca cagtaccacc i860 

gtcaccctgc ccaaggcgaa atga 1884 



[SEQUENCE ID NO: 4 6] 



GenBank 

X17115. Human mRNA.for IgM 

LOCUS HSIGM201 2213 bp mRNA PRI 03-APR-1995 

DEFINITION Human mRNA for IgM heavy chain complete sequence. 
X1711S 

X17115.1 GI:33450 

Ig heavy chain; IgM gene; IgM heavy chain; transmembrane protein, 
human. 

Homo sapiens 

Eukaryota; Metazoa ,- chordata; Craniata; Vertebrata; Euteleostomi 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 2213) 
Friedlander , R . M . 
Direct Submission 

Submitted (03-NOV- 1989) Friedlander R. M. , Harvard Medical School, 
Howard Hughes Medical Institute, Department of Genetics, 2 5 
ShatCuck St, Boston, MA 02115, USA 

2 (bases 1 to 2213) 

Friedlander, R.M . , Nussenzweig, M . C . and Leder,P. 

Complete nucleotide sequence of the membrane form of the human IgM 
heavy chain 

Nucleic Acids Res. IB (14), 4278 (1990) 
903324S0 

3 (bases 1 to 2213) 

Kristensen, T. , Lopez, R. and Prydz,H. 

An estimate of the sequencing error frequency in the DNA sequence 
databases 

DNA Seq. 2 (6), 343-346 (1992) 
93075997 

Erratum: [ [published erratum appears in DNA Seq 1993 ; 3 ( 5) :337) ] 
For genomic sequence see <K01306> , <X14939> and 
<X14940>. The author reports various conflicts with these 
sequences. Data kindly reviewed (30-MAY-1990) by Friedlander R.M. 
Locacion/Qualif iers 
1. .2213 

/organism* "Homo sapiens" 
/db xrefo-taxon: 9606" 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
COMMENT 



FEATURES 

source 
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FIGURE 7KK 



/clone="201-203" 

/cell_line»" lymphoma 201" 

/cell_type«"B" 

/ 1 is sue_type- " lymphoid" 
misc_feature 1..3 9 

/note= "putative VECTOR sequence Bluescript SKP+" 

/citation- (3 ] 
CDS 73.. 1956 

/note** "precursor (AA -IS to 612)" 

/codon_start=l 

/prot ein_id» u CAA3 4 9 7 1 . l " 

/db_xref »"GI :334S1" 

/ db^xref a "SWISS - PROT : P01871 " 

/db_xref s"SWISS-PROT: P2 076 9" 

/ translation* "MDWTWRFLFVVAAATGVQSQVQLVQSGAEVKKPGSSVKVSCKAS 
GGTFSS YAI SWVRQAPGQGLEWMGG 1 1 P I FGTANYAQKFQGR VT I TADESTSTAYMEL 
SSLRSEDTAVYYCAKTGILGPYSSGWYPNSDYYYYGMDV>IGQGTTVTVSSGSASAPTL 
FPLVSCENSPSDTSSVAVGCLAQDFLPDS1TFSWKYKNNSDISSTRGFPSVLRGGKYA 
ATSQVLLPSKDVMQGTDEHVVCKVQHPNGNKEKNVPLPVIAELPPKVSVFVPPRDGFF 
GN PRS KS KL I CQATG FS P RQ I Q VS WLREGKQVGS G VTTDQVQAEAKESGPTTYKVTS T 
LTI KESDWLSQSMFTCRVDHRGLTFQQNASSMCVPDQDTAIRVFAI PPSFAS I FLTKS 
TKLTCLVTDLTTYDSVT I SWTRQNGEAVKTHTNI S ESHPNATFSAVGEAS I CEDDWNS 
GERFTCTVTHTDLPSPLKQT I SRP KGVALHRPDVYLLP P AREQLNLRESAT I TCLVTG 
FS PADVFVQWMQRGQPLS PEKYVTSAPMPEPQAPGRYFAHS ILTVSEEEWNTGETYTC 
WAHEALPNRVTERTVDKSTEGEVSADEEGFENLWATASTFIVLFLLSLFYSTTVTLF 
KVK" 

[SEQUENCE ID NO: 62] 

sig_peptide 73.. 117 

mat_peptide 118.. 1953 

/product-" IgM heavy chain (AA 1 to 612) " 
polyA_site 2213 

/note«"polyA site" 
BASE COUNT 462 a 708 c 629 g 414 t 

ORIGIN 

1 gctctagaac tagtggatcc cccgggctgc aggaattctc taaagaagcc cctgggagca 
61 cagctcatca ccatggactg gacctggagg ttcctctttg tggtggcagc agctacaggt 
121 gtccagtccc aggtgcagct ggtgcagtct ggggctgagg tgaagaagcc tgggtcctcg 
181 gtgaaggtct cctgcaaggc ttctggaggc accttcagca gctatgctat cagctgggtg 
24 1 cgacaggccc ctggacaagg gcttgagtgg atgggaggga tcatccctat ctttggtaca 
3 01 gcaaactacg cacagaagtt ccagggcaga gtcacgatta ccgcggacga atccacgagc 
3 61 acagcctaca tggagctgag cagcctgaga tctgaggaca cggccgtgta ttactgtgcg 
421 aaaaccggga tcctggggcc gtatagcagt ggctggtacc cgaactcgga ctactactac 
481 tacggtatgg acgtctgggg ccaagggacc acggtcaccg tctcctcagg gagtgcatcc 
541 gccccaaccc ttttccccct cgtctcctgt gagaattccc cgtcggatac gagcagcgtg 
601 gccgttggct gcctcgcaca ggacttcctt cccgactcca tcactttctc ctggaaatac 
661 aagaacaact ctgacatcag cagcacccgg ggcttcccat cagtcctgag agggggcaag 
721 tacgcagcca cctcacaggt gctgctgcct tccaaggacg tcatgcaggg cacagacgaa 
781 cacgtggtgt gcaaagtcca gcaccccaac ggcaacaaag aaaagaacgt gcctcttcca 
84 1 gtgattgctg agctgcctcc caaagtgagc gtcttcgtcc caccccgcga cggcttcttc 
901 ggcaaccccc gcagcaagtc caagctcatc tgccaggcca cgggtttcag tccccggcag 
961 attcaggtgt cctggctgcg cgaggggaag caggtggggt ctggcgtcac cacggaccag 
1021 gtgcaggctg aggccaaaga gtctgggccc acgacctaca aggtgaccag cacactgacc 
1081 atcaaagaga gcgactggct cagccagagc atgttcacct gccgcgtgga tcacaggggc 
1141 ctgaccttcc agcagaatgc gtcctccatg tgtgtccccg atcaagacac agccatccgg 
1201 gtcttcgcca tccccccatc ctttgccagc atcttcctca ccaagtccac caagttgacc 
1261 tgcctggtca cagacctgac cacctatgac agcgtgacca tctcctggac ccgccagaat 
1321 ggcgaagctg tgaaaaccca caccaacatc tccgagagcc accccaatgc cactttcagc 
13 81 gccgtgggtg aggccagcat ctgcgaggat gactggaatt ccggggagag gttcacgtgc 
1441 accgtgaccc acacagacct gccctcgcca ctgaagcaga ccatctcccg gcccaagggg 
1501 gtggccctgc acaggcccga tgtctacttg ctgccaccag cccgggagca gctgaacctg 
1561 cgggagtcgg ccaccatcac gtgcctggtg acgggcttct ctcccgcgga cgtcttcgtg 
1621 cagtggatgc agagggggca gcccttgtcc ccggagaagt atgtgaccag cgccccaatg 

16 81 cctgagcccc aggccccagg ccggtacttc gcccacagca tcctgaccgt gtccgaagag 

17 41 gaatggaaca cgggggagac ctacacctgc gtggtggccc atgaggccct gcccaacagg 
1801 gccaccgaga ggaccgtgga caagcccacc gagggggagg tgagcgccga cgaggagggc 
1861 tttgagaacc tgtgggccac cgcctccacc ttcatcgtcc tcttcctcct gagcctcttc 
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1921 tacagtacca ccgtcacctt gctcaaggtg aaatgatccc aacagaagaa catcggagac 
1981 cagagagagg aactcaaagg ggcgccgcct ccgggtctgg ggccccggcc tgcgcggccc 
2041 gttggcacgt gtttctcttc ccgcccggcc tccagttgtg tgctcccaca caggctcccc 
2101 tctcgaccgg caggggctgg ctggctcgca ggccacgagg tgggctctac cccacactgc 
2161 tttgccgtgc atacgcccgc cgccctgaaa caaatacgca cattttaccc atg 

{SEQUENCE ID NO* 61] 
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FIGURE 8D 



NUCLEOTIDE- AND AMINO ACID SEQUENCE OF PROTEIN 
CODING REGION OF pSHuSC 

1137 

GTCGATTCCCGGGTACC ATG GTG CTC TTC GTG CTC ACC TGC 
met val leu phe val leu thr cys 

1178/9 

CTG CTG GCG GTC TTC CCA GCC ATC TCC ACG AAG AGT CCC ATA TTT GGT CCC GAG GAG GTG 

leu leu ala val phe pro ala ile ser thr lys ser pro ile phe gly pro glu glu val 
123 8/29 

AAT AGT GTG GAA GGT AAC TCA GTG TCC ATC ACG TGC TAC TAC CCA CCC ACC TCT GTC AAC 

asn ser val glu gly asn ser val ser ile thr cys tyr tyr pro pro chr ser val asn 
1298/49 

CGG CAC ACC CGG AAG TAC TGG TGC CGG CAG GGA GCT AGA GGT GGC TGC ATA ACC CTC ATC 

arg his thr arg lys tyr trp cys arg gin gly ala arg gly gly cys ile thr leu ile 
1358/69 

TCC TCG GAG GGC TAC GTC TCC AGC AAA TAT GCA GGC AGG GCT AAC CTC ACC AAC TTC CCG 

ser ser glu gly tyr val ser ser lys tyr ala gly arg ala asn leu thr asn phe pro 
1418/89 

GAG AAC GGC ACA TTT GTG GTG AAC ATT "GCC CAG CTG AGC CAG GAT GAC TCC GGG CGC TAC 

glu asn gly thr phe val val asn ile ala gin leu ser gin asp asp ser gly arg tyr 
1478/109 

AAG TGT GGC CTG GGC ATC AAT AGC CGA GGC CTG TCC TTT GAT GTC AGC CTG GAG GTC AGC 

lys cys gly leu gly ile asn ser arg gly leu ser phe asp "val ser leu glu val ser 
1538/129 

CAG GGT CCT GGG CTC CTA AAT GAC ACT AAA GTC TAC ACA GTG GAC CTG GGC AGA ACG GTG 

gin gly pro gly leu leu asn asp chr lys val tyr thr val asp leu gly arg thr val 
1598/149 

ACC ATC AAC TGC CCT TTC AAG ACT GAG AAT GCT CAA AAG AGG AAG TCC TTG TAC AAG CAG 

thr ile asn cys pro phe lys thr glu asn ala gin lys arg lys ser leu tyr lys gin 
1658/169 

ATA GGC CTG TAC CCT GTG CTG GTC ATC GAC TCC AGT GGT TAT GTG AAT CCC AAC TAT ACA 

ile gly leu tyr pro val leu val ile asp ser ser gly tyr val asn pro asn tyr thr 
1718/189 

GGA AGA ATA CGC CTT GAT ATT CAG GGT ACT GGC CAG TTA CTG TTC AGC GTT GTC ATC AAC 

gly arg ile arg leu asp ile gin gly thr gly gin leu leu phe ser val val ile asn 
1778/209 

CAA CTC AGG . CTC AGC GAT GCT GGG CAG TAT CTC TGC CAG GCT GGG GAT GAT TCC AAT AGT 

gin leu arg leu ser asp ala gly gin tyr leu cys gin ala gly asp asp ser asn ser 
183 8/229 

AAT AAG AAG AAT GCT GAC CTC CAA GTG CTA AAG CCC GAG CCC GAG CTG GTT TAT GAA GAC 

asn lys lys asn ala asp leu gin val leu lys pro glu pro glu leu val tyr glu asp 
1898/249 

CTG AGG GGC TCA GTG ACC TTC CAC TGT GCC CTG GGC CCT GAG GTG GCA AAC GTG GCC AAA 

leu arg gly ser val thr phe his cys ala leu gly pro glu val ala asn val ala lys 
1958/269 

TTT CTG TGC CGA CAG AGC AGT GGG GAA AAC TGT GAC GTG GTC GTC AAC ACC CTG GGG AAG 

phe leu cys arg gin ser ser gly glu asn cys asp val val val asn thr leu gly lys 
2018/289 

AGG GCC CCA GCC TTT GAG GGC AGG ATC CTG CTC AAC CCC CAG GAC AAG GAT GGC TCA TTC 

arg ala pro ala phe glu gly arg ile leu leu asn pro gin asp lys asp gly ser phe 
2078/309 

AGT GTG GTG ATC ACA GGC CTG AGG AAG GAG GAT GCA GGG CGC TAC CTG TGT GGA GCC CAT 

ser val val ile thr gly leu arg lys glu asp ala gly arg tyr leu cys gly ala his 
2138/329 

TCG GAT GGT CAG CTG CAG GAA GGC TCG CCT ATC CAG GCC TGG CAA CTC TTC GTC AAT GAG 

ser asp gly gin leu gin glu gly ser pro ile gin ala trp gin leu phe val asn glu 
2198/349 

GAG TCC ACG ATT CCC CGC AGC CCC ACT GTG GTG AAG GGG GTG GCA GGA AGC TCT GTG GCC 

glu ser thr ile pro arg ser pro thr val val lys gly val ala gly ser ser val ala 
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2258/369 

GTG CTC TGC CCC TAC AAC CGT AAG GAA AGC AAA AGC ATC AAG TAC TGG TGT CTC TGG GAA 
val leu cys pro tyr asn arg lys glu ser lys ser ile lys tyr trp cys leu trp glu 
2318/389 

GGG GCC CAG AAT GGC CGC TGC CCC CTG CTG GTG GAC AGC GAG GGG TGG GTT AAG GCC CAG 
gly ala gin asn gly arg cys pro leu leu val asp ser glu gly trp val lys ala gin 
2378/409 

TAC GAG GGC CGC CTC TCC CTG CTG GAG GAG CCA GGC AAC GGC ACC TTC ACT GTC ATC CTC 
tyr glu gly arg leu ser leu leu glu glu pro gly asn gly thr phe thr val ile leu 
2438/429 " " 

AAC CAG CTC ACC AGC CGG GAC GCC GGC TTC TAC TGG TGT CTG ACC AAC GGC GAT ACT CTC 
asn gin leu thr ser arg asp ala gly phe tyr trp cys leu thr asn gly asp thr leu 
2498/449 

TGG AGG ACC ACC GTG GAG ATC AAG ATT ATC GAA GGA GAA CCA AAC CTC AAG GTT CCC GGG 
trp arg thr thr val glu ile lys ile ile glu gly glu pro asn leu lys val pro gly 
2SS8/469 

AAT GTC ACG GCT GTG CTG GGA GAG ACT CTC AAG GTC CCC TGT CAC TTT CCA TGC AAA TTC 
asn val thr ala val leu gly glu thr leu lys val pro cys his phe pro cys lys phe 
2618/489 

TCC TCG TAC GAG AAA TAC TGG TGC AAG TGG AAT AAC ACG GGC TGC CAG GCC CTG CCC AGC 
ser ser tyr glu lys tyr trp cys lys trp asn asn thr gly cys gin ala leu pro ser 
2678/509 

CAA GAC GAA GGC CCC AGC AAG GCC TTC GTG AAC TGT GAC GAG AAC AGC CGG CTT GTC TCC 
gin asp glu gly pro ser lys ala phe val asn cys asp glu asn ser arg leu val ser 
2738/529 

CTG ACC CTG AAC CTG GTG ACC AGG GCT GAT GAG GGC TGG TAC TGG TGT GGA GTG AAG CAG 
leu thr leu asn leu val thr arg ala asp glu gly trp tyr trp cys gly val lys gin 
2798/549 

GGC CAC TTC TAT GGA GAG ACT GCA GCC GTC TAT GTG GCA GTT GAA GAG AGG AAG GCA GCG 
gly his phe tyr gly glu thr ala ala val tyr val ala val glu glu arg lys ala ala 
2858/569 

GGG TCC CGC GAT GTC AGC CTA GCG AAG GCA GAC GCT GCT CCT GAT GAG AAG GTG CTA GAC 
gly ser arg asp val ser leu ala lys ala asp ala ala pro asp glu lys val leu asp 
2918/589 

TCT GGT TTT CGG GAG ATT GAG AAC AAA GCC ATT CAG GAT CCC AGG CTT TTT GCA GAG TGA 
ser gly phe arg glu ile glu asn lys ala ile gin asp pro arg leu phe ala glu 
2978 
- A TTC 

[SEQUENCE ID NO:12] 
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SEQUENCE LISTING 



<110> PliANET BIOTECHNOLOGY, INCORPORATED 
Larrick, James William 
Wycoff, Keith Lynn 

<120> NOVEL I MMUNO ADHESION FOR THE PREVENTION 
OF RHINOVIRUS INFECTION 



<130> 030905 . 0003 . WO 

<140> To be assigned 
<141> 2001-04-27 

<150> 60/200,298 
<151> 2000-04-27 



<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 1596 

<212> DNA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (1) . . . <1596) 

<223> ICAM-1 

<400> 1 

atg get ccc age age ccc egg ccc gcg ctg ccc gca etc ctg gtc ctg 48 

Met Ala Pro Ser Ser Pro Arg Pre Ala Leu Pro Ala Leu Leu Val Leu 

1 5 10 15 

etc ggg get ctg ttc cca gga cct ggc aat gee cag aca tct gtg tec 96 
Leu Gly Ala Leu Phe Pro Gly Pro Gly Asn Ala Gin Thr Ser Val Ser 
20 25 ■ 30 

ccc tea aaa gtc ate ctg ccc egg gga ggc tec gtg ctg gtg aca tgc 144 
Pro Ser Lys Val He Leu Pro Arg Gly Gly Ser Val Leu Val Thr Cys 
35 40 4S 

age ace tec tgt gac cag ccc aag ttg ttg ggc ata gag acc ccg ttg 192 
Ser Thr Ser Cys Asp Gin Pro Lys Leu Leu Gly He Glu Thr Pro Leu 
50 55 60 

cct aaa aag gag ttg etc ctg cct ggg aac aac egg aag gtg tat gaa 240 
Pro Lys Lys Glu Leu Leu Leu ■ Pro Gly Asn Asn Arg Lys Val Tyr Glu 
65 -70 75 80 
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FIGURE 9 (ConL) 

ctg age aat gtg caa gaa gat age caa cca atg tgc tat tea aac tgc 288 

Leu Ser Asn Val Gin Glu Asp Ser Gin Pro Met Cys Tyr Ser Asn Cys 

85 90 95 

cct gat ggg cag tea aca get aaa ace ttc etc ace gtg tac tgg act 336 

Pro Asp Gly Gin Ser Thr Ala Lys Thr Phe Leu Thr Val Tyr Trp Thr 

100 10S 110 

cca gaa egg gtg gaa ctg gca ccc etc ccc tct tgg cag cca gtg ggc 3 84 

Pro Glu Arg Val Glu Leu Ala Pro Leu Pro Ser Trp Gin Pro Val Gly 

115 120 125 

aag aac ctt ace eta cgc tgc cag gtg gag ggt ggg gca ccc egg gee 432 

Lys Asn Leu Thr Leu Arg Cys Gin Val Glu Gly Gly Ala Pro Arg Ala 

130 135 140 

aac etc acc gtg gtg ctg etc cgt ggg gag aag gag ctg aaa egg gag 4 80 

Asn Leu Thr Val Val Leu Leu Arg Gly Glu Lys Glu Leu Lys Arg Glu 

145 150 ~ 155 160 

cca get gtg ggg gag ccc get gag gtc acg acc acg gtg ctg gtg agg 528 

Pro Ala Val Gly Glu Pro Ala Glu Val Thr Thr Thr Val Leu Val Arg 

165 170 175 

aga gat cac cat gga gee aat ttc teg tgc cgc act gaa ctg gac ctg 576 

Arg Asp His His Gly Ala Asn Phe Ser Cys Arg Thr Glu Leu Asp Leu 

180 185 190 

egg ccc caa ggg ctg gag ctg ttt gag aac acc teg gee ccc tac cag 624 

Arg Pro Gin Gly Leu Glu Leu Phe Glu Asn Thr Ser Ala Pro Tyr Gin 

195 200 205 

etc cag acc ttt gtc ctg cca gcg act ccc cca caa ctt gtc age ccc 672 

Leu Gin Thr Phe Val Leu Pro Ala Thr Pro Pro Gin Leu Val Ser Pro 

210 215 220 

egg gtc eta gag gtg gac acg cag ggg acc gtg gtc tgt tec ctg gac 720 

Arg Val Leu Glu Val Asp Thr Gin Gly Thr Val Val Cys Ser Leu Asp 

225 230 235 240 

ggg c tg ttc cca gtc teg gag gee cag gtc cac ctg gca ctg ggg gac 768 

Gly Leu Phe Pro Val Ser Glu Ala Gin Val His Leu Ala Leu Gly Asp 

245- 250 255 

cag agg ttg aac ccc aca gtc acc tat ggc aac gac tec ttc teg gee 816 

Gin Arg Leu Asn Pro Thr Val Thr Tyr Gly Asn Asp Ser Phe Ser Ala 

260 ' 265 270 

aag gec tea gtc agt gtg acc gca gag gac gag ggc acc cag egg ctg 864 

Lys Ala Ser Val Ser Val Thr Ala Glu Asp Glu Gly Thr Gin Arg Leu 

275 280 28S 

acg tgt gca gta ata ctg ggg aac cag age cag gag aca ctg cag aca 912 

Thr Cys Ala Val lie Leu Gly. Asn Gin Ser Gin Glu Thr Leu Gin Thr 

290 295 300 

gtg acc ate tac age ttt ccg gcg ccc aac gtg att ctg acg aag cca . 960 

Val Thr lie Tyr Ser Phe Pro Ala Pro Asn Val lie Leu Thr Lys Pro 

305 " 310 315 320 
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FIGURE 9 (Cont.) 



gag gtc tea gaa ggg acc gag gtg aca gtg aag tgt gag gec cac cct 1008 

Glu Val Ser Glu Gly Thr Glu Val Thr Val Lys Cys Glu Ala His Pro 
32S 330 335 

aga gec aag gtg acg ctg aat ggg gtt cca gec cag cca ctg ggc ccg 1056 

Arg Ala Lys Val Thr Leu Asn Gly Val Pro Ala Gin Pro Leu Gly Pro 

340 345 350 

agg gec cag etc ctg ctg aag gec acc cca gag gac aac ggg cgc age 1104 

Arg Ala Gin Leu Leu Leu Lys Ala Thr Pro Glu Asp Asn Gly Arg Ser 

355 360 365 

ttctcc tgc tct gca acc ctg gag gtg gee ggc cag ctt ata cac aag 1152 

Phe Ser Cys Ser Ala Thr Leu Glu Val Ala Gly Gin Leu He His Lys 

370 375 380 

aac cag acc egg gag ctt cgt gtc ctg tat ggc ccc cga ctg gac gag 1200 

Asn Gin Thr Arg Glu Leu Arg Val Leu Tyr Gly Pro Arg Leu Asp Glu 

385 390 395 400 

agg gat tgt ccg gga aac tgg acg tgg cca gaa aat tec cag cag act 1248 

Arg Asp Cys Pro Gly Asn Trp Thr Trp Pro Glu Asn Ser Gin Gin Thr 
405 410 415 

cca atg tgc cag get tgg ggg aac cca ttg ccc gag etc aag tgt eta 1296 

Pro Met Cys Gin Ala Trp Gly Asn Pro Leu Pro Glu Leu Lys Cys Leu 

420 425 430 

aag gat ggc act ttc cca ctg ccc ate ggg gaa tea gtg act gtc act 1344 

Lys Asp Gly Thr Phe Pro Leu Pro He Gly Glu Ser Val Thr Val Thr 

435 440 445 

cga gat ctt gag ggc acc tac etc tgt egg gee agg age act caa ggg 1392 

Arg Asp Leu Glu Gly Thr Tyr Leu Cys Arg Ala Arg Ser Thr Gin Gly 

450 455 460 

gag gtc acc cgc aag gtg acc gtg aat gtg etc tec ccc egg tat gag 1440 

Glu Val Thr Arg Lys Val Thr Val Asn Val Leu Ser Pro. Arg Tyr Glu 

465 470 475 480 

att gtc ate ate act gtg gta gca gec gca gtc ata atg ggc act gca 1488 

He Val He He Thr Val Val Ala Ala Ala Val He Met Gly Thr Ala 
485 490 495 

ggc etc age acg tac etc tat aac cgc cag egg aag ate aag aaa tac 153 6 

Gly Leu Ser Thr Tyr Leu Tyr Asn Arg Gin Arg Lys He Lys Lys Tyr 

500 505 510 

aga eta caa cag gec caa aaa ggg acc ccc atg aaa ccg aac aca caa 1584 

Arg Leu Gin Gin Ala Gin Lys Gly Thr Pro Met Lys Pro Asn Thr Gin 

SIS 520 525 

gee acg cct ccc 1596 
Ala Thr Pro Pro 
530 



[DNA Sequence is SEQUENCE ID NO:l] 
[Protein Sequence is SEQUENCE ID NO:2] 



WO 03/064992 



66/91 



PCT/US02/34197 



FIGURE 9 (Cont.) 

<210> 2 

<211> 532 

<212> PRT 

<213> Homo Sapien 



<400> 2 



Met 


Ala 


Pro 


Ser 


Ser 


Pro 


Arg 


Pro Ala 


Leu Pro Ala 


Leu 


Leu 


Val Leu 


1 








5 








10 






15 


Leu 


Gly 


Ala 


Leu 


Phe 


Pro 


Gly 


Pro Gly 


Asn Ala Gin 


Thr 


Ser 


Val Ser 








20 








25 






30 




Pro 


Ser 


Lys 


Val 


He 


Leu 


Pro 


Arg Gly 


Gly Ser Val 


Leu 


Val 


Thr Cys 






35 










40 




45 






Ser 


Thr 


Ser 


Cys 


Asp 


Gin 


Pro 


Lys Leu 


Leu Gly He 


Glu 


Thr 


Pro Leu 




50 










55 




60 








Pro 


Lys 


Lys 


Glu 


Leu 


Leu 


Leu 


Pro Gly 


Asn Asn Arg 


Lys 


Val 


Tyr Glu 


65 










70 






75 






80 


Leu 


Ser 


Asn 


Val 


Gin 


Glu 


Asp 


Ser Gin 


Pro Met Cys 


Tyr 


Ser 


Asn Cys 










85 








90 






9S 


Pro 


Asp 


Gly 


Gin 


Ser 


Thr 


Ala 


Lys Thr 


Phe Leu Thr 


Val 


Tyr 


Trp Thr 








100 








105 






110 




Pro 


Glu 


Arg 


Val 


Glu 


Leu 


Ala 


Pro Leu 


Pro Ser Trp 


Gin 


Pro 


Val Gly 






115 










120 




125 






Lys 


Asn 


Leu 


Thr 


Leu Arg 


Cys 


Gin Val 


Glu Gly Gly Ala 


Pro 


Arg Ala 




130 










135 




140 








Asn 


Leu 


Thr 


Val 


Val 


Leu 


Leu Arg Gly 


Glu Lys Glu 


Leu 


Lys 


Arg Glu 


145 










150 






155 






160 


Pro 


Ala 


Val 


Gly 


Glu 


Pro 


Ala 


Glu Val 


Thr Thr Thr 


Val 


Leu 


Val Arg 










165 








170 






175 


Arg 


Asp 


His 


His 


Gly Ala 


Asn 


Phe Ser 


Cys Arg Thr 


Glu 


Leu 


Asp Leu 








180 








185 






190 




Arg 


Pro 


Gin 


Gly 


Leu 


Glu 


Leu 


Phe Glu 


Asn Thr Ser 


Ala 


Pro 


Tyr Gin 






195 










200 




205 






Leu 


Gin 


Thr 


Phe 


Val 


Leu 


Pro 


Ala Thr 


Pro Pro Gin 


Leu 


Val 


Ser Pro 




210 










215 




220 








Arg 


Val 


Leu 


Glu 


Val 


Asp 


Thr Gin Gly 


Thr Val Val 


Cys 


Ser 


Leu Asp 


225 










230 






235 






240 


Gly 


Leu 


Phe 


Pro 


Val 


Ser 


Glu 


Ala Gin 


Val His Leu 


Ala 


Leu 


Gly Asp 










245 








250 






2 5S 


Gin 


Arg 


Leu 


Asn 


Pro 


Thr 


Val 


Thr Tyr 


Gly Asn Asp 


Ser 


Phe 


Ser Ala 








260 








265 






270 




Lys 


Ala 


Ser 


Val 


Ser 


Val 


Thr 


Ala Glu 


Asp Glu Gly Thr Gin Arg Leu 






275 










280 




285 






Thr 


Cys 


Ala 


Val 


He 


Leu 


Gly Asn Gin 


Ser Gin Glu 


Thr 


Leu 


Gin Thr 




290 










295 




300 








Val 


Thr 


He 


Tyr 


Ser 


Phe 


Pro 


Ala Pro 


Asn Val He 


Leu 


Thr 


Lys Pro 


305 










310 






315 






320 


Glu 


Val 


Ser 


Glu 


Gly Thr 


Glu 


Val Thr 


Val Lys Cys 


Glu 


Ala 


His Pro 










325 








330 






335 


Arg 


Ala 


Lys 


Val 


Thr 


Leu 


Asn Gly Val 


Pro Ala Gin 


Pro 


Leu Gly Pro 








340 








345 






350 




Arg 


Ala 


Gin 


Leu 


Leu 


Leu 


Lys 


Ala Thr 


Pro Glu Asp 


Asn 


Gly Arg Ser 






355 










360 




365 






Phe 


Ser 


Cys 


Ser 


Ala 


Thr 


Leu 


Glu Val 


Ala Gly Gin 


Leu 


He 


His Lys 




370 










375 




380 








Asn 


Gin 


Thr 


Arg 


Glu 


Leu 


Arg Val Leu 


Tyr Gly Pro 


Arg 


Leu 


Asp Glu 


385 










390 






395 • 




400 


Arg 


Asp 


Cys 


Pro 


Gly Asn 


Trp Thr Trp 


Pro Glu Asn 


Ser 


Gin 


Gin Thr 










405 








410 






415 
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FIGURE 9 (Cont.) 



Pro 


Met 


Cys 


Gin 
420 


Ala 


Trp 


Gly Asn 


Pro 
425 


Leu 


Pro 


Glu 


Leu 


Lys 
430 


Cys Leu 


Lys 


ASp 


Gly 
435 


Thr 


Phe 


Pro 


Leu Pro 
440 


lie 


Gly 


Glu 


Ser 


Val 
445 


Thr 


Val Thr 


Arg 


Asp 
450 


Leu 


Glu 


Gly 


Thr 


Tyr Leu 
455 


Cys 


Arg 


Ala 


Arg 
460 


Ser 


Thr 


Gin Gly 


Glu 


Val 


Thr 


Arg 


Lys 


Val 


Thr Val 


Asn 


Val 


Leu 


Ser 


Pro 


Arg 


Tyr Glu 


465 










470 








475 








480 


lie 


Val 


He 


He 


Thr 


Val 


Val Ala 


Ala 


Ala 


Val 


He 


Met 


Gly Thr Ala 










485 








490 










495 


Gly 


Leu 


Ser 


Thr 


Tyr 


Leu Tyr Asn 


Arg 


Gin Arg 


Lys 


He 


Lys 


Lys Tyr 








500 








505 










510 




Arg 


Leu 


Gin 


Gin 


Ala 


Gin 


Lys Gly Thr 


Pro 


Met 


Lys 


Pro 


Asn 


Thr Gin 






515 








520 










525 






Ala 


Thr 
530 


Pro 


Pro 























[SEQUENCE ID NO:2] 



<210> 3 

<211> 3003 

<212> DNA 

<213> Homo Sapien 



<400> 3 

gctataagga 

tcagcctcgc 

tcggggctct 

tcctgccccg 

tgttgggcat 

aggtgtatga 

ctgatgggca 

aactggcacc 

tggagggtgg 

tgaaacggga 

gagatcacca 

tggagctgtt 

ctcccccaca 

gttccctgga 

agaggttgaa 

gtgtgaccgc 

agagccagga 

tgacgaagcc 

gagccaaggt 

tgctgaaggc 

tggccggcca 

gactggacga 

caatgtgcca 

tcccactgcc 

gtcgggccag 

cccggtatga 

gcctcagcac 

cccaaaaagg 

ggacagggcc 

tggaagacat 

tgtcctcagt 

gccacgcatc 

atgattgatg 

'cccaccatga 



tcacgcgccc 
tatggctccc 
gttcccagga 
gggaggctcc 
agagaccccg 
actgagcaat 
gtcaacagct 
cctcccctct 
ggcaccccgg 
gccagctgtg 
tggagccaat 
tgagaacacc 
acttgtcagc 
cgggctgttc 
ccccacagtc 
agaggacgag 
gacactgcag 
agaggtctca 
gacgctgaat 
caccccagag 
gcttatacac 
gagggattgt 
ggcttggggg 
catcggggaa 
gagcactcaa 
gattgtcatc 
gtacctctat 
gacccccatg 
tcttcctcgg 
atgccatgca 
cagatacaac 
tgatctgtag 
gatgttaaag 
ggacatacaa 



cagtcgacgc 
agcagccccc 
cctggcaatg 
gtgctggtga 
ttgcctaaaa 
gtgcaagaag 
aaaaccttcc 
tggcagccag 
gccaacctca 
ggggagcccg 
ttctcgtgcc 
tcggccccct 
ccccgggtcc 
ccagtctcgg 
acctatggca 
ggcacccagc 
acagtgacca 
gaagggaccg 
ggggttccag 
gacaacgggc 
aagaaccaga 
ccgggaaact 
aacccattgc 
tcagtgactg 
ggggaggtca 
atcactgtgg 
aaccgccagc 
aaaccgaaca 
ccttcccata 
gctacaccta 
agcatttggg 
tcacatgact 
tctagcctga 
ctgggaaata 



tgagctcctc 
ggcccgcgct 
cccagacatc 
catgcagcac 
aggagttgct 
atagccaacc 
tcaccgtgta 
tgggcaagaa 
ccgtggtgct 
ctgaggtcac 
gcactgaact 
accagctcca 
tagaggtgga 
aggcccaggt 
acgactcctt 
ggctgacgtg 
tctacagctt 
aggtgacagt 
cccagccact 
gcagcttctc 
cccgggagct 
ggacgtggcc 
ccgagctcaa 
tcactcgaga 
cccgcaaggt 
tagcagccgc 
ggaagatcaa 
cacaagccac 
ttggtggcag 
ccggccctgg 
gccatggtac 
aagccaagag 
tgagagggga 
ctgaaacttg 



tgctactcag 
gcccgcactc 
tgtgtccccc 
ctcctgtgac 
cctgcctggg 
aatgtgctat 
ctggactcca 
ccttacccta 
gctccgtggg 
gaccacggtg 
ggacctgcgg 
gacctttgtc 
cacgcagggg 
ccacctggca 
ctcggccaag 
tgcagtaata 
tccggcgccc 
gaagtgtgag 
gggcccgagg 
ctgctctgca 
tcgtgtcctg 
agaaaattcc 
gtgtctaaag 
tcttgagggc 
gaccgtgaat 
agtcataacg 
gaaatacaga 
gcctccctga 
tggtgccaca 
gacgccggag 
ctgcacacct 
gaaggagcaa 
agtggtgggg 
ctgcctattg 



agttgcaacc 
ctggtcctgc 
tcaaaagtca 
cagcccaagt 
aacaaccgga 
tcaaactgcc 
gaacgggtgg 
cgctgccagg 
gagaaggagc 
ctggtgagga 
ccccaagggc 
ctgccagcga 
accgtggtct 
ctgggggacc 
gcctcagtca 
ctggggaacc 
aacgtgattc 
gcccacccta 
gcecagctcc 
accctggagg 
tatggccccc 
cagcagactc 
gatggcactt 
acctacctct 
gtgctctccc 
ggcactgcag 
ctacaacagg 
acctatcccg 
ctgaacagag 
gacagggcat 
aaaacactag 
gactcaagac 
gagacatagc 
ggtatgctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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FIGURE 9 (Cont.) 



ggccccacag 
aggcccacac 
gatgatatgt 
ggctaaatga 
caccaggtac 
ggggctggga 
gcacaaaagc 
cttattcctc 
atttctgcca 
atatgcccaa 
attgcagctc 
ggtattggag 
atgtgtagac 
cactgcagtc 
gggaccatag 
acggggtctc 
gcc 



acttacagaa 
ttcctgacgg 
atttattcat 
acataggtct 
agttgtacag 
cttctcattg 
actatatgga 
ccttcccccc 
gtgttcacaa 
gctatgcctt 
cagtttcctg 
gactccctcc 
aagctctcgc 
ttgacctttt 
gctcacaaca 
gcaacattgc 



gaagtggccc 
atgccagctt 
ttgttatttt 
ctggcctcac 
gttgtacact 
gccaacctgc 
ctggtaatgg 
aaaactgaca 
tgaca.ctcag 
gtcctcttgt 
cagcgatcag 
cagctttgga 
tctgtcaccc 
gggctcaagt 
ccacacctgg 
ccagacttcc 



tccatagaca 
gggcactgct 
accagctatt 
ggagctccca 
gcaggagagt 
ctttccccag 
ttcacaggtt 
cctttgttag 
cggtcatgtc 
cctgtttgca 
ggtcctgcaa 
agcctcatcc 
aggctggagt 
gatcctccca 
caaatttgat 
tttgcgttag 



<210> 4 

<211> 6 

<212> PRT 

<213> Homo Sapien 

<400> 4 

Ser Glu Lys Asp Glu Leu 
1 5 



<210> 5 

<211> 7 

<212> PRT 

<213> Homo Sapien 

<400> 5 

Arg Ser Glu Lys Asp Glu Leu 
1 S 



tgtgtagcat 
gtctactgac 
tattgagtgt 
gtccatgtca 
gccc:ggcaaa 
aaggagtgat 
cagagattac 
ccacctcccc 
tggacatgag 
tttcactggg 
gcagtgggga 
gcgtgcgtgt 
gcagtggcgc 
cctcagcctc 
tttttttttt 
ttaataaagc 



caaaacacaa 
cccaaccctt 
cttttatgta 
catCcaaggt 
aagatcaaat 
ttttctatcg 
ccagtgaggc 
acccacatac 
tgcccaggga 
agcttgcact 
a 9999gccaa 
gtgtgtgtgt 
aatcatggtt 
ctgagtagct 
tttttcagag 
ttcctcaact 



[SEQUENCE ID 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
25S0 
2640 
2700 
2760 
2820 
2B80 
2940 
3000 
3003 

NO:3] 



[SEQUENCE IB NO:4] 



[SEQUENCE ID NO:5} 



<210> 6 

<211> 52 

<212> DNA 

<213> Homo Sapien 

<400> 6 

tctgttccca ggaactagtt tggcacagac atctgtgtcc ccctcaaaag tc 52 

[SEQUENCE ID NO:6] 

<210> 7 

<211> 38 

<212> DNA 

<213> Homo Sapien 



<400> 7 

cacaccgggg actagtcaca ttcacggtca cctcgcgg 



[SEQUENCE ID NO:7] 
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FIGURE 9 (Cont.) 

<210> 8 

<211> 799 

<212> PRT 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (1) . . . (448) 

<22 3> ICAM-1 Extracellular Domains 

<220> 

<221> CDS 

<222> (453) . . . (799) 

<223> Human IgA2m(2) 



<400> 8 



Gin 


Thr 


Ser 


Val 


Ser 


Pro 


Ser 


Lys 


Val 


lie Leu 


Pro Arg 


Gly Gly Ser 


1 








5 










10 






15 


Val 


Leu 


Val 


Thr 


Cys 


Ser 


Thr 


Ser 


Cys 


Asp Gin 


Pro 


Lys 


Leu Leu Gly 








20 










25 








30 


lie 


Glu 


Thr 


Pro 


Leu 


Pro 


Lys 


Lys 


Glu 


Leu Leu 


Leu 


Pro 


Gly Asn Asn 






35 










40 








45 




Arg 


Lys 


Val 


Tyr 


Glu 


Leu 


Ser 


Asn 


Val 


Gin Glu 


Asp 


Ser 


Gin Pro Met 




50 










55 








60 






Cys 


Tyr 


Ser 


Asn 


Cys 


Pro 


Asp 


Gly 


Gin 


Ser Thr 


Ala 


Lys 


Thr Phe Leu 


65 










70 








75 






80 


Thr 


Val 


Tyr 


Trp 


Thr 


Pro 


Glu 


Arg 


Val 


Glu Leu 


Ala 


Pro 


Leu Pro Ser 










85 










90 






95 


Trp 


Gin 


Pro 


Val 


Gly 


Lys 


Asn 


Leu 


Thr 


Leu Arg 


Cys 


Gin 


Val Glu Gly 








100 










105 








110 


Gly 


Ala 


Pro 


Arg 


Ala 


Asn 


Leu 


Thr 


Val 


Val Leu 


Leu 


Arg 


Gly Glu Lys 






115 










120 








125 




Glu 


Leu 


Lys 


Arg 


Glu 


Pro 


Ala 


Val 


Gly 


Glu Pro 


Ala 


Glu 


Val Thr Thr 




130 










135 








140 






Thr 


Val 


Leu 


Val 


Arg 


Arg 


Asp 


His 


His 


Gly Ala Asn 


Phe 


Ser Cys Arg 


145 










150 








155 






160 


Thr 


Glu 


Leu 


Asp 


Leu 


Arg 


Pro 


Gin 


Gly 


Leu Glu 


Leu 


Phe 


Glu Asn Thr 










165 










170 






175 


Ser 


Ala 


Pro 


Tyr 


Gin 


Leu 


Gin 


Thr 


Phe 


Val Leu 


Pro 


Ala 


Thr Pro Pro 








180 










185 








190 


Gin 


Leu 


Val 


Ser 


Pro 


Arg 


Val 


Leu 


Glu 


Val Asp Thr Gin Gly Thr Val 






195 










200 








205 




Val 


Cys 


Ser 


Leu 


Asp 


Gly 


Leu 


Phe 


Pro 


Val Ser 


Glu 


Ala 


Gin Val His 




210 










215 








220 






Leu 


Ala 


Leu 


Gly 


Asp 


Gin 


Arg 


Leu 


Asn 


Pro Thr 


Val 


Thr Tyr Gly Asn 


225 










230 








235 






240 


Asp 


Ser 


Phe 


Ser 


Ala 


Lys 


Ala 


Ser 


Val 


Ser Val Thr Ala Glu Asp Glu 










245 










250 






255 


Gly 


Thr 


Gin 


Arg 


Leu .Thr 


Cys 


Ala 


Val 


He Leu Gly Asn Gin Ser Gin 








260 










265 








270 


Glu 


Thr 


Leu 


Gin 


Thr 


Val 


Thr 


lie 


Tyr 


Ser Phe 


Pro 


Ala 


Pro Asn Val 






275 










280 








285 




lie 


Leu 


Thr 


Lys 


Pro 


Glu 


Val 


Ser 


Glu 


Gly Thr 


Glu 


Val 


Thr Val Lys 




290 










295 








300 






Cys 


Glu 


Ala 


His 


Pro 


Arg 


Ala 


Lys 


Val 


Thr Leu Asn Gly Val Pro Ala 


305 










310 








315 






320 


Gin 


Pro 


Leu 


Gly 


Pro 


Arg 


Ala 


Gin 


Leu 


Leu Leu 


Lys 


Ala 


Thr Pro Glu 










325 










330 






335 


Asp 


Asn 


Gly 


Arg 


Ser 


Phe 


Ser 


Cys 


Ser 


Ala Thr 


Leu 


Glu 


Val Ala Gly 



340 345 350 
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FIGURE 9 (Cont) 



Gin 


Leu 


He 


His 


Lys 


Asn 


Gin 


Thr 




Glu 


Leu 


Arg 


Val 


Leu Tyr Gly 






355 










360 










365 






Pro 


Arc 
3 7 0 


Leu 


AS"D 


Glu 


Arg 


Asp 
375 


Cys 


Pro 


Gly 


Asn 


Trp 
380 


Thr 


Trp 


Pro Glu 


Asn 


Ser 


Gin 


Gin 


Thr 


Pro 


Met 


Cys 


Gin 


Ala 


Trp 


Gly 


Asn 


Pro 


Leu Pro 


385 










390 










395 








400 


Glu 




Lys 


Cys 


Leu 


Lys 


Asp 


Gly 


Thr 


Phe 


Pro 


Leu 


Pro 


He Gly Glu 










405 










410 










415 


Ser 


Val 


Thr 


Val 


Thr 


Arg 


Asp 


Leu 


Glu 


Gly Thr Tyr 


Leu 


Cys Arg Ala 








420 










425 










430 




Arg 


Ser 


Thr 


Gin 


Gly Glu Val 


Thr 


Arg 


Glu 


Val 


Thr 


Val 


Asn 


Val Thr 






4 35 










440 










445 






Ser 


Gly 
450 


Ser 


Ser 


Ala 


Ser 


Pro 
455 


Thr 


Ser 


Pro 


Lys 


val 
460 


Phe 


Pro 


Leu Ser 


Leu 


Hen 
ASp 


Ser 


Thr 


Pro Gin Asp Gly 


Asn 


Val 


Val 


Val 


Ala 


Cys 


Leu Val 


465 










470 










475 








480 


Gin 


Gly 


Phe 


Phe 


Pro 
485 


Gin 


Glu 


Pro 


Leu 


Ser 
490 


Val 


Thr 


Trp 


Ser 


Glu Ser 
495 


Gly 


Gin 


Asn 


Val 


Thr 


Ala 


Arg 


Asn 


Phe 


Pro 


Pro 


Ser 


Gin Asp Ala Ser 








500 










5 0 5 










510 






Asp 


Leu 
515 


Tyr 


Thr 


Thr 


Ser 


Ser 
520 


Gin 


Leu 


Thr 


Leu 


Pro 
52S 


Ala 


Thr Gin 


Cys 


Pro 


Asp 


Gly 


Lys 


Ser 


Val 


Thr 


Cys 


His 


Val 


Lys 


His 


Tyr Thr Asn 




530 










535 










540 








Ser 


Ser 


Gin 


Asp 


Val 


Thr 


Val 


Pro 


Cys 


Arg 


Val 


Pro 


Pro 


Pro 


Pro Pro 


545 










550 










555 








560 


Cys 


Cys 


His 


Pro 


Arg 
565 


Leu 


Ser 


Leu 


His 


Arg 
570 


Pro 


Ala 


Leu 


Glu 


Asp Leu 
575 


Leu 


Leu 


Gly 


Ser 


Glu 


Ala 


Asn 


Leu 


Thr 


Cys 


Thr Leu Thr Gly Leu Arg 








580 










585 










590 




Asp 


Ala 


Ser 


Gly 


Ala 


Thr 


Phe 


Thr 


Trp 


Thr 


Pro 


Ser 


Ser Gly Lys Ser 






595 










600 










605 






Ala 


Val 


Gin 


Gly 


Pro 


Pro 


Glu 


Arg 


Asp 


Leu 


Cys Gly 


Cys 


Tyr 


Ser Val 




610 










61S 










620 








Ser 


Arg 


Val 


Leu 


Pro Gly 


Cys 


Ala 


Gin 


Pro Trp Asn His Gly Glu Thr 


625 










630 










635 








640 


Phe 


Thr 


Cys 


Thr 


Ala 
645 


Ala 


His 


Pro 


Glu 


Leu 
650 


Lys 


Thr 


Pro 


Leu 


Thr Ala 
655 


Asn 


He 


Thr 


Lys 


Ser Gly Asn Thr 


Phe 


Arg 


Pro 


Glu 


Val 


His 


Leu Leu 








660 










665 










670 




Pro 


Pro 


Pro 
675 


Ser 


Glu 


Glu 


Leu 


Ala 
660 


Leu 


Asn 


Glu 


Leu 


Val 
685 


Thr 


Leu Thr 


Cys 


Leu 
690 


Ala 


Arg 


Gly 


Phe 


Ser 
695 


Pro 


Lys 


Asp 


Val 


Leu 
700 


Val 


Arg 


Trp Leu 


Gin 


Gly 


Ser 


Gin 


Glu 


Leu 


Pro 


Arg 


Glu 


Lys 


Tyr 


Leu 


Thr 


Trp 


Ala Ser 


705 










710 










715 








720 


Arg 


Gin 


Glu 


Pro 


Ser 


Gin 


Gly Thr 


Thr 


Thr 


Tyr 


Ala 


Val 


Thr 


Ser He 










725 










730 










735 


Leu 


Arg 


Val 


Ala 


Ala 


Glu 


Asp 


Trp 


Lys 


Lys 


Gly Glu 


Thr 


Phe 


Ser Cys 








740 










745 










7S0 




Met 


Val 


Gly 
755 


His 


Glu 


Ala 


Leu 


Pro 

760 


Leu 


Ala 


Phe 


Thr 


Gin 
765 


Lys 


Thr He 


Asp 


Arg 


Leu 


Ala 


Gly Lys 


Pro 


Thr 


His 


He 


Asn 


Val 


Ser 


Val 


Val Met 




770 










77 S. 










780 








Ala 


Glu 


Ala 


Asp 


Gly Thr 


Cys 


Tyr 


Arg 


Ser 


Glu 


Lys Asp 


Glu 


Leu 


785 










790 










795 
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Figure 10: 

1/1 

GGT ACC ACT TCT CTC AAT CCA ACT TTC TAA 
5 61/10 

CTT TCT TTG TCT TTG CTT TTG TTC ACC TCT 

LSLSLLLFTS 

121/30 

GAC AAA TCA GGA AGT GTG CTG CAC CAC TGG 
10 DKSGSVLHHW 
181/50 

GCT CAC AAA TTC ATC AGC CCA CAG TTG AGA 

AHKFISPQLR 

241/70 

15 ACA ACC TTA ATG AAA CTG ACA GAA GAC AGA 
TTLMKLTEDR 
301/90 

CAG AAA GTT CTG CCA GGA GGA GAC ACT TAC 
QKVLPGGDTY 
20 361/110 

CAG ATT TAT TAT GAA AAC AGA CAA GGG TAC 
QI YYENRQ .GY 
421/130 

GAT GGA GAA CTC CAT GAA GAT CTC TTT TTC 
25 DGELHEDLFF 



ACA ATG GCT TCT AAA CCT TTC TTG TCT CTT 
MASKPFLSL 

ACT AGT TTG GCT GAC CTG TAC TTC ATT TTG 
TSLA DLYF I L 

AAT GAA ATC TAT TAC TTT GTG GAA CAG TTG 
NEIYYFVEQL 

ATG TCC TTT ATT GTT TTC TCC ACC CGA GGA 
MSFIVFSTRG 

GAA CAA ATC CGT CAA GGC CTA GAA GAA CTC 
EQIRQGLEEL 

ATG CAT GAA GGA TTT GAA AGG GCC AGT GAG 
MHEGFERAS ; E 

AGG ACA GCC AGC GTC ATC ATT GCT TTG ACT 
RTASVI IALT 

TAT TCA GAG AGG GAG GCT AAT AGG TCT CGA 
YSEREANRSR 
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481/150 

GAT CTT GGT GCA ATT GTT TAC TGT GTT GGT GTG AAA GAT TTC AAT GAG ACA GAG CTG GCC 

DLGAIVYCVGVKDFNETQLA 

541/170 

5 CGG ATT GCG GAC AGT AAG GAT CAT GTG TTT CCC GTG AAT GAC GGC TTT GAG GCT CTG CAA 
RIADS KDHVFPVNDGFQALQ 
601/190 

GGC ATC ATC CAC TCA ATT TTG AGC TCT GCT TCC CCA ACC AGC CCT AAG GTC TTC CCT CTC 
G I I HS I LS SAS P TS P KV FPL 
10 661/210 

AGC CTT GAC AGC ACC CCT CAA GAT GGT AAT GTT GTC GTT GCT TGC CTT GTC CAG GGT TTC 

SLDSTPQDGNVVVACLVQGF 

721/230 

TTC CCT CAG GAG CCA CTC TCT GTT ACC TGG TCT GAA TCT GGA CAG AAT GTT ACC GCC AGA 
15 FPQEPL SVTWSESGQNVTAR 
781/250 

AAC TTC CCA CCT AGC CAG GAT GCC TCC GGT GAC CTC TAC ACC ACC AGC TCT CAG CTC ACC 

NFPPSQDASGDLYTTSSQLT 

841/270 

20 CTT CCA GCC ACC CAG TGC CCA GAT GGT AAG TCC GTT ACC TGC CAT GTT AAG CAC TAC ACC 
LPATQC PDGKSVTCHVKHYT 
901/290 

AAC TCC AGC CAG GAT GTT ACT GTT CCA TGC CGT GTT CCA CCA CCT CCA CCA TGC TGC CAC 
NSSQDVTVPCRVPPPPPCCH 
25 961/310 
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CCA CGT CTC TCT CTT CAC CGT CCT GCC CTT GAG GAC TTG CTC TTG GGT TCT GAA GCT AAC 
PRLSLHR PALEDLLLGS E A N 
1021/330 

CTC ACC TGC ACC CTC ACC GGT CTC AGA GAT GCC TCT GGT GCC ACC TTC ACC TGG ACC CCA 
5 LTCTLTGLRDASGATFTWTP 
1081/350 

AGC TCT GGT AAG AGC GCT GTT CAA GGA CCA CCT GAG CGT GAC CTC TGT GGA TGC TAC TCT 

SSGKSAVQGPPERDLCGCYS 

1141/370 

10 GTT AGC TCT GTT CTT CCT GGT TGT GCC CAG CCT TGG AAC CAC GGT GAG ACC TTC ACC TGC 
VSSVLPGCAQPWNHGETFTC 
1201/390 

ACT GCT GCC CAC CCA GAG TTG AAG ACC CCA CTT ACC GCC AAC ATC ACC AAG TCC GGA AAC 
TAAHPEL KTPLTANI TKSGN 
15 1261/410 

ACC TTC CGT CCC GAG GTC CAC CTC TTG CCA CCA CCA TCT GAG GAG CTT GCC CTC AAT GAG 

TFRPEVHLLPPPSEELALNE 

1321/430 

CTT GTT ACC CTC ACC TGC CTT GCT CGT GGA TTC AGC CCA AAG GAT GTT CTT GTT AGG TGG 

» 

20 LVTLTCLARGFSPKDVLVRW 
1381/450 

CTT CAG GGA TCT CAG GAG CTT CCA CGT GAG AAG TAC CTC ACT TGG GCT TCC CGT CAG GAG 

LQGSQELPREKYLTWASRQE 

1441/470 

25 CCA AGC CAG GGA ACT ACC ACC TAC GCT GTT ACC AGC ATC CTT CGT GTT GCT GCT GAG GAC 
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PSQGTTTYAVTSI LRVAAED 
1501/490 

TGG AAG AAG GGT GAG ACC TTC TCC TGC ATG GTT GGT CAC GAG GCC CTT CCA CTT GCC TTC 
WKKGETFSCMVGHEALPLAF 
5 1561/510 

ACC CAG AAG ACC ATT GAT CGT TTG GCT GGA AAG CCA ACC CAC ATC AAT GTT TCT GTT GTC 
TQKTIDRLAG KPTHINVSVV 
1621/530 1650/538 
ATG GCT GAG GCT GAT GGA ACC TGC TAC TAA 
10 MAEADGTCY* 
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Figure 11. pGPTV-kan-ocs-ATR-IgA2: 

Bgl II 

1 

CTGGCCGGCGCCAGATCTGGGGAACCTGTGGTTGGCATGCACATACAAATGGACGAACGGATAAACCTTTTC 
5 ACGCCCTT 

81 

TTAAATATCCGATTATTCTAATAAACGCTCTTTTCTCTTAGGTTTACCCGCCAATATATCCTGTCAAACACT 
GATAGTTT 

161 

10 AAACTG AAGG CGGG AAACGAC AATC TG AT C ATG AGCGG AG AATT AAGGG AGTC ACG TT ATG AC CCCG C CGAT 
GACGCGGG 

EcoR I 

241 

AC AAG C CG TTTTACGTTTGGAACTG AC AG AA CC G C AACG TTGAAG G AGCC ACTC AG C CGATC TG AATT C ACT 
15 GCTTTAAT 

321 

G AG AT ATG CGAG AC G CCTATG ATCG C ATG AT ATTTG CTTT C AATTCTGTTGTG C ACGTTGT AAAAAAC CTG A 
GCATGTGT 

401 

20 AGCTCAGATCCTTACCGCCGGTTTCGGTTCATTCTAATGAATATATCACCCGTTACTATCGTATTTTTATGA 
ATAATATT 

481 

CTCCGTTCAATTTACTGATTGTACCCTACTACTTATATGTACAATATTAAAATGAAAACAATATATTGTGCT 
GAATAGGT * 

25 

Sac I Asc I 
561 

TTATAGCGACATCTATGATAGAGCGCCACAATAACAAACAATTGCGTTTTATTATTACAAATCCAATTTTGA 
GCTCGGCG 
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641 

CGCCAGCTGGACATCATGTTGGATATGAAACAACTATTATTTATCTACATGTTTTAGATGTTATCTGATTAT 
TTTTATAC 

721 

5 GTAGTCTTCTATTGATGAGGAGTCTAAGGCTATAGAATTATATATCTAAATGATTAATATATATATTATTAA 
TAATTAAC 

801 

AATAATTAATATATTATAATTTATATATATATATTTTATATTATTATAATAATATTCTTACAAATATAATTA 
TTATATTC 

10 881 

GACGGTATCGGGGCAATTGTATTCGACGGTATCGCGATAAGCTCGCGGATCCCTGAAAGCGACGTTGGATGT 
TAACATCT 

961 

ACAAATTGCCTTTTCTTATCGACCATGTACGTAAGCGCTTACGTTTTTGGTGGACCCTTGAGGAAACTGGTA 
1 5 GCTGTTGT 

1041 

GGGCCTGTGGTCTCAAGATGGATCATTAATTTCCACCTTCACCTACGATGGGGGGCATCGCACCGGTGAGTA 
ATATTGTA 

1121 

20 CGGCTAAGAGCGAATTTGGCCTGTAGGATCCCTGAAAGCGACGTTGGATGTTAACATCTACAAATTGCCTTT 
TCTTATCG 

1201 

ACCATGTACGTAAGCGCTTACGTTTTTGGTGGACCCTTGAGGAAACTGGTAGCTGTTGTGGGCCTGTGGTCT 
CAAGATGG 

25 1281 

ATCATTAATTTCCACCTTCACCTACGATGGGGGGCATCGCACCGGTGAGTAATATTGTACGGCTAAGAGCGA 
ATTTGGCC 

1361 
30 GCGCTTAC 



1441 

G TTTTTGG TGG AC C CTTG AGG AAAC TGG T AG CTG TTG TGGG CC TG TG G TC TC AAG ATGG ATC ATTAATTTCC 
ACCTTCAC 
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1521 

CTACGATGGGGGGCATCGCACCGGTGAGTAATATTGTACGGCTAAGAGCGAATTTGGCCTGTAGGATCCGCG 
AGCTGGTC 

1601 

5 AATC C C ATTG CTTTTG AAG C AG C TCAAC ATTG ATCTCTTTC T CG AT CG AGGG AG ATTTTTCAAATC AG TG CG 
CAAGACGT . 

1681 

GACGTAAGTATCCGAGTCAGTTTTTATTTTTCTACTAATTTGGTCGTTTATTTCGGCGTGTAGGACATGGCA 
ACCGGGCC 

10 1761 

TGAATTTCGCGGGTATTCTGTTTCTATTCCAACTTTTTCTTGATCCGCAGCCATTAACGACTTTTGAATAGA 
TACGCTGA 

1841 

CACGCCAAGCCTCGCTAGTCAAAAGTGTACCAAACAACGCTTTACAGCAAGAACGGAATGCGCGTGACGCTC 
1 5 GCGGTGAC 

1921 

GCCATTTCGCCTTTTCAGAAATGGATAAATAGCCTTGCTTCCTATTATATCTTCCCTTAATTAAGGTACCAC 
TTCTCTCA 

2001 

20 ATCCAACTTTCTAAACAATGGCTTCTAAACCTTTCTTGTCTCTTCTTTCTTTGTCTTTGCTTTTGTTCACCT 
CTACTAGT 

2081 

TTGG CTG AC CTGT AC TTCATTTTGGAC AAATC AGGAAG TG TG CTG C AC C ACTGG AATG AAAT CTATT AC TTT 
GTGGAACA 

25 2161 

GTTGGCTCACAAATTCATCAGCCCACAGTTGAGAATGTCCTTTATTGTTTTCTCCACCCGAGGAACAACCTT 
AATGAAAC 

2241 

TGACAGAAGACAGAGAACAAATCCGTCAAGGCCTAGAAGAACTCCAGAAAGTTCTGCCAGGAGGAGACACTT 
30 ACATGCAT 

2321 

GAAGGATTTGAAAGGGCCAGTGAGCAGATTTATTATGAAAACAGACAAGGGTACAGGACAGCCAGCGTCATC 
ATTGCTTT 
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2401 

GACTGATGGAGAACTCCATGAAGATCTCTTTTTCTATTCAGAGAGGGAGGCTAATAGGTCTCGAGATCTTGG 
TGCAATTG 

2481 

5 TTTACTGTGTTGGTGTGAAAGATTTCAATGAGACACAGCTGGCCCGGATTGCGGACAGTAAGGATCATGTGT 
TTCCCGTG 

2561 

AATGACGGCTTTCAGGCTCTGCAAGGCATCATCCACTCAATTTTGAGCTCTGCTTCCCCAACCAGCCCTAAG 
GTCTTCCC 

10 2641 

TCTCAGCCTTGACAGCACCCCTCAAGATGGTAATGTTGTCGTTGCTTGCCTTGTCCAGGGTTTCTTCCCTCA 
GGAGCCAC 

2721 

TCTCTGTTACCTGGTCTGAATCTGGACAGAATGTTACCGCCAGAAACTTCCCACCTAGCCAGGATGCCTCCG 
15 GTGACCTC 

2801 

TACACCACCAGCTCTCAGCTCACCCTTCCAGCCACCCAGTGCCCAGATGGTAAGTCCGTTACCTGCCATGTT 
AAGCACTA 

2881 

20 CACCAACTCCAGCCAGGATGTTACTGTTCCATGCCGTGTTCCACCACCTCCACCATGCTGCCACCCACGTCT 
CTCTCTTC 

2961 

ACCGTCCTGCCCTTGAGGACTTGCTCTTGGGTTCTGAAGCTAACCTCACCTGCACCCTCACCGGTCTCAGAG 
ATGCCTCT 

25 3041 

GGTGCCACCTTCACCTGGACCCCAAGCTCTGGTAAGAGCGCTGTTCAAGGACCACCTGAGCGTGACCTCTGT 
GGATGCTA 

3121 

Q'pQ'PGTTAGCTCTGTTCTTCCTGGTTGTGCCCAGCCTTGGAACCACGGTGAGACCTTCACCTGCACTGCTGC 
30 CCACCCAG 

3201 

AGTTG AAG AC C C C ACTTAC CG C C AAC ATC AC C AAGTC CGG AAAC AC CTTC CGTCCCG AGG TCC ACCTCTTG C 
CACCACCA 
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3281 

TCTGAGGAGCTTGCCCTCAATGAGCTTGTTACCCTCACCTGCCTTGCTCGTGGATTCAGCCCAAAGGATGTT 
CTTGTTAG 

3361 

5 G TGG CTTC AGGGATCTC AGG AG CTT CC ACG TG AG AAG T AC CTC ACTTGGG CTTC C CG TC AGG AG CC AAG C CA 
GGGAACTA 

3441 

CCACCTACGCTGTTACCAGCATCCTTCGTGTTGCTGCTGAGGACTGGAAGAAGGGTGAGACCTTCTCCTGCA 
TGGTTGGT 

10 3521 

CACGAGGCCCTTCCACTTGCCTTCACCCAGAAGACCATTGATCGTTTGGCTGGAAAGCCAACCCACATCAAT 
GTTTCTGT 

3601 

TGTCATGGCTGAGGCTGATGGAACCTGCTACTAAGATCTGTGAATTCCTGCAGCCCGGGGGATCCACTAGTT 
1 5 CTAGCTAG 

3681 

AGCGGCCGCCACCGCGGTGGCGAATTAACAGAGGTGGATGGACAGACCCGTTCTTACACCGGACTGGGCGCG 
GGATAGGA 

3761 

20 TATTCAGATTGGGATGGGATTGAGCTTAAAGCCGGCGCTGAGACCATGCTCAAGGTAGGCAATGTCCTCAGC 
GTCGAGCC 

3841 

CGGCATCTATGTCGAGGGCATTGGTGGAGCGCGCTTCGGGGATACCGTGCTTGTAACTGAGACCGGATATGA 
GGCCCTCA 

25 3921 

CTCCGCTTGATCTTGGCAAAGATATTTGACGCATTTATTAGTATGTGTTAATTTTCATTTGCAGTGCAGTAT 
TTTCTATT 

4001 

CGATCTTTATGTAATTCGTTACAATTAATAAATATTCAAATCAGATTATTGACTGTCATTTGTATCAAATCG 
30 TGTTTAAT 

4081 

GGATATTTTTATTATAATATTGATGATAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCT 
GGCGTTAC 
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4161 

CCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGCGCCAAGCTTCACGCTGCCGCAAGCAC 
TCAGGGCG 

4241 

5 C AAGGGC TG C T AAAG G AAGCG G AAC ACG T AG AAAG C C AGTC CG C AG AAAC GGTG C TG AC C C CGG ATG AATGT 
CAGCTACT 

4321 

GGCTATCTGGACAAGGGAAAACGCAAGCGCAAAGAGAAAGCAGGTAGCTTGCAGTGGGCTTACATGGCGATA 
GCTAGACT 

10 4401 

GGGCGGTTTTATGGACAGCAAGCGAACCGGAATTGCCAGCTGGGGCGCCCTCTGGTAAGGTTGGGAAGCCCT 
GCAAAGTA 

4481 

AACTGGATGGCTTTCTTGCCGCCT^AGGATCTGATGGCGCAGGGGATCAAGATCATGAGCGGAGAATTAAGGG 
15 AGTCACGT 

4561 

TATGACCCCCGCCGATGACGCGGGACAAGCCGTTTTACGTTTGGAACTGACAGAACCGCAACGTTGAAGGAG 
CCACTCAG 

4641 

20 CCGCGGGTTTCTGGAGTTTAATGAGCTAAGCACATACGTCAGAAACCATTATTGCGCGTTCAAAAGTCGCCT 
AAGGTCAC 

4721 

TATCAGCTAGCAAATATTTCTTGTCAAAAATGCTCCACTGACGTTCCATAAATTCCCCTCGGTATCCAATTA 
GAGTCTCA 

25 4801 

T ATTC ACTCTC AATC CAG ATCTGG ATCGTTTCG C ATG ATTG AAC AAG ATGGATTG C ACG C AGG TTC TC CG G C 
CGCTTGGG 

4881 

TGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGT 
30 CAG CG CAG 

4961 

GGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGG 
CTATCGTG 



WO 03/064992 



81/91 



PCT/US02/34197 



5041 

GCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCT 
ATTGGGCG 

5121 

5 AAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAA 
TGCGGCGG 

5201 

CTG C AT ACG C TTG AT C CGG CT AC CTGC C C ATTCG AC C AC CAAGCGAAACATCGC AT CG AG CGAGCACGT ACT 
CGGATGGA 

10 5281 

AGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAG 
GCTCAAGG 

5361 

CGCGCATGCCCGACGGCGATGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAA 
1 5 ATGGCCGC 

5441 

TTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGT 
GATATTGC 

5521 

20 TGAAGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCG 
CATCGCCT 

5601 

TCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGAGGATCCCCCGATGAGCTAAGCTAGCTATATC 
ATCAATTT 

25 5681 

ATGTATTACACATAATATCGCACTCAGTCTTTCATCTACGGCAATGTACCAGCTGATATAATCAGTTATTGA 
AATATTTC 

5761 

TGAATTTAAACTTGCATCAATAAATTTATGTTTTTGCTTGGACTATAATACCTGACTTGTTATTTTATCAAT 
30 AAATATTT 

5841 

AAACTATATTTCTTTCAAGATGGGAATTAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCC 
TGGCGTTA 
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5921 

CCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATC 
GCCCTTCC 

6001 

5 CAACAGTTGCGCAGCCTGAATGGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGG 
CTTTCCCC 

6081 

GTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAAC 
TTGATTTG 

10 6161 

GGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTC 
TTTAATAG 

6241 

TGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTT 
15 GCCGATTT 

6321 

CGGAACCACCATCAAACAGGATTTTCGCCTGCTGGGGCAAACCAGCGTGGACCGCTTGCTGCAACTCTCTCA 
GGGCCAGG 

6401 

20 CGGTGAAGGGCAATCAGCTGTTGCCCGTCTCACTGGTGAAAAGAAAAACCACCCCAGtACATTAAAAACGTC 
CGCAATGT 

6481 

GTTATTAAGTTGTCTAAGCGTCAATTTGTTTACACCACAATATATCCTGCCACCAGCCAGCCAACAGCTCCC 
CGACCGGC 

25 6561 AGCTCGGCACAAAATCACCACTCGATACAGGCAGCCCATCAG 
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Figure 12. pGPTV-hpt-ocs-35SJ/SC 



l 

CTGATGGGCTGCCTGTATCGAGTGGTGATTTTGTGCCGAGCTGCCGGTCGGGGAGCTGTTGGCTGGCTGGTG 
5 GCAGGATA 

81 

TATTGTGGTGTAAACAAATTGACGCTTAGACAACTTAATAACACATTGCGGACGTTTTTAATGTACTGGGGT 
GGTTTTTC 

161 

10 TTTTCACCAGTGAGACGGGCAACAGCTGATTGCCCTTCACCGCCTGGCCCTGAGAGAGTTGCAGCAAGCGGT 
CCACGCTG 

241 

GTTTGCCCCAGCAGGCGAAAATCCTGTTTGATGGTGGTTCCGAAATCGGCAAAATCCCTTATAAATCAAAAG 
AATAGCCC 

15 321 

GAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAAA 
GGGCGAAA 

401 

AACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCAAATCAAGTTTTTTGGGGTCGAGGTGCCG 
20 TAAAGCAC 

481 

TAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGG 
AAGGGAAG 

561 i 
25 AAAGCGAAAGGAGCGGGCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCT 
ATTACGCC 

641 

AGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTT 
GTAAAACG 

30 721 

ACGGCCAGTGAATT/^ATTCCCATCTTGAAAGAAATATAGTTTAAATATTTATTGATAAAATAACAAGTCAGG 
TATTATAG 
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801 

TCCAAGCAAAAACATAAATTTATTGATGCAAGTTTAAATTCAGAAATATTTCAATAACTGATTATATCAGCT 
GGTACATT 

861 

5 G C CGT AGATG AAAG ACTG AGTGCG ATATT ATG TG TAAT AC AT AAATTG ATG AT AT AG CT AG CTTAG C T CAT C 
GGGGGATC 

961 

CCGGTCGGCATCTACTCTATTCCTTTGCCCTCGGACGAGTGCTGGGGCGTCGGTTTCCACTATCGGCGAGTA 
CTTCTACA 

10 1041 

CAG C CATCGGTC C AGACGGC CG CG CTTCTG CGGG CG ATTTG TGT ACG CC CG AC AGT C CCGG C T CCGG AT CGG 
ACGATTGC 

1121 

GTCGCATCGACCCTGCGCCCAAGCTGCATCATCGAAATTGCCGTCAACCAAGCTCTGATAGAGTTGGTCAAG 
1 5 ACCAATGC 

1201 

GGAGCATATACGCCCGGAGCCGCGGCGATCCTGCAAGCTCCGGATGCCTCCGCTCGAAGTAGCGCGTCTGCT 
GCTCCATA 

1281 

20 CAAGCCAACCACGGCCTCCAGAAGAAGATGTTGGCGACCTCGTATTGGGAATCCCCGAACATCGCCTCGCTC 
CAGTCAAT 

1361 

GACCGCTGTTATGCGGCCATTGTCCGTCAGGACATTGTTGGAGCCGAAATCCGCGTGCACGAGGTGCCGGAC 
TTCGGGGC 

25 1441 

AGTCCTCGGCCCAAAGCATCAGCTCATCGAGAGCCTGCGCGACGGACGCACTGACGGTGTCGTCCATCACAG 
TTTGCCAG 

1521 

TGATACACATGGGGATCAGCAATCGCGCATATGAAATCACGCCATGTAGTGTATTGACCGATTCCTTGCGGT 
30 CCGAATGG 

1601 

GCCGAACCCGCTCGTCTGGCTAAGATCGGCCGCAGCGATCGCATCCATGGCCTCCGCGACCGGCTGCAGAAC 
AGCGGGCA 
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1681 

GTTCGGTTTCAGGCAGGTCTTGCAACGTGACACCCTGTGCACGGCGGGAGATGCAATAGGTCAGGCTCTCGC 
TGAATGCC 

1761 

5 CCAATGTCAAGCACTTCCGGAATCGGGAGCGCGGCCGATGCAAAGTGCCGATAAACATAACGATCTTTGTAG 
AAACCATC 

1841 

GG CG C AG CTATTT ACCCG C AGG AC ATAT C C ACG CC CTCCTAC ATCG AAGCTG AAAG C ACG AG ATT CTTCG CC 
CTCCGAGA 

10 1921 

GCTGCATCAGGTCGGAGACGCTGTCGAACTTTTCGATCAGAAACTTCTCGACAGACGTCGCGGTGAGTTCAG 
GCTTTTTC 

2001 

ATATCTTATTGCCCCCCTAGAGTCGAGATCTGGATTGAGAGTGAATATGAGACTCTAATTGGATACCGAGGG 
15 GAATTTAT 

2081 

GG AACGTC AG TGG AG C ATTTTTG AC AAG AAAT ATTTG CTAG CTG ATAGTG ACC TTAG G C G ACTTTTG AACG C 
GCAATAAT 

2161 

20 GGTTTCTGACGTATGTGCTTAGCTCATTAAACTCCAGAAACCCGCGGCTGAGTGGCTCCTTCAACGTTGCGG 
TTCTGTCA 

2241 

GTTCCAAACGTAAAACGGCTTGTCCCGCGTCATCGGCGGGGGTCATAACGTGACTCCCTTAATTCTCCGCTC 
ATGATCTT 

25 2321 

GATCCCCTGCGCCATCAGATCCTTGGCGGCAAGAAAGCCATCCAGTTTACTTTGCAGGGCTTCCCAACCTTA 
CCAGAGGG 

2401 

CGCCCCAGCTGGCAATTCCGGTTCGCTTGCTGTCCATAAAACCGCCCAGTCTAGCTATCGCCATGTAAGCCC 
30 ACTGCAAG 

2481 

CTACCTGCTTTCTCTTTGCGCTTGCGTTTTCCCTTGTCCAGATAGCCAGTAGCTGACATTCATCCGGGGTCA 
GCACCGTT 
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2561 

TCTGCGGACTGGCTTTCTACGTGTTCCGCTTCCTTTAGCAGCCCTTGCGCCCTGAGTGCTTGCGGCAGCGTG 
AAGCTTGG 

2641 

5 CGCGCCAGCTGGACATCATGTTGGATATGAAACAACTATTATTTATCTACATGTTTTAGATGTTATCTGATT 
ATTTTTAT 

2 721 

ACGTAGTCTTCTATTGATGAGGAGTCTAAGGCTATAGAATTATATATCTAAATGATTAATATATATATTATT 
AATAATTA 

10 2801 

ACAATAATTAATATATTATAATTTATATATATATATTTTATATTATTATAATAATATTCTTACAAATATAAT 
TATTATAT 

2881 

T CG ACGGT ATCGGGG C AATTG ATTC C CG ATC CT ATC TG TC AC TTC ATC AAAAGG AC AG TAG AAAAGG AAGGT 
15 GGCACCTA 

2961 

CAAATGCCATCATTGCGATAAAGGAAAGGCTATCATTCAAGATGCCTCTGCCGACAGTGGTCCCAAAGATGG 
ACCCCCAC 

3041 

20 CCACGAGGAGCATCGTGGAAAAAGAAGACGTTCCAACCACGTCTTCAAAGCAAGTGGATTGATGTGATATCT 
CCACTGAC 

3121 

GTAAGGGATGACGCACAATCCCACTATCCTTCGCAAGACCCTTCCTCTATATAAGGAAGTTCATTTCATTTG 
GAGAGGAC 

25 3201 

ACGCTGAAATCACCAGTCTCTCTCTACAAGGTACCATGGTGCTCTTCGTGCTCACCTGCCTGCTGGCGGTCT 
TCCCAGCC 

3281 

ATCTCCACGAAGAGTCCCATATTTGGTCCCGAGGAGGTGAATAGTGTGGAAGGTAACTCAGTGTCCATCACG 
30 TGCTACTA 

3361 

CCCACCCACCTCTGTCAACCGGCACACCCGGAAGTACTGGTGCCGGCAGGGAGCTAGAGGTGGCTGCATAAC 
CCTCATCT 
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3441 

C CTCGG AGGG CT ACG T C T C C AG C AAAT ATG C AGG CAGG GC TAACCTC AC C AACTTCCCGG AG AACGG C AC AT 
TTGTGGTG 

3521 

5 AACATTGCCCAGCTGAGCCAGGATGACTCCGGGCGCTACAAGTGTGGCCTGGGCATCAATAGCCGAGGCCTG 
TCCTTTGA 

3601 

TGTCAGCCTGGAGGTCAGCCAGGGTCCTGGGCTCCTAAATGACACTAAAGTCTACACAGTGGACCTGGGCAG 
AACGGTGA 

10 3681 

CCATCAACTGCCCTTTCAAGACTGAGAATGCTCAAAAGAGGAAGTCCTTGTACAAGCAGATAGGCCTGTACC 
CTGTGCTG 

3761 

GTCATCGACTCCAGTGGTTATGTGAATCCCAACTATACAGGAAGAATACGCCTTGATATTCAGGGTACTGGC 
1 5 CAGTTACT 

3841 

GTTCAGCGTTGTCATCAACCAACTCAGGCTCAGCGATGCTGGGCAGTATCTCTGCCAGGCTGGGGATGATTC 
CAATAGTA 

3921 

20 ATAAGAAGAATGCTGACCTCCAAGTGCTAAAGCCCGAGCCCGAGCTGGTTTATGAAGACCTGAGGGGCTCAG 
TGACCTTC 

4001 

CACTGTG C CCTGGG CC CTG AGGTGG C AAACG TG GC C AAATTTCTGTG C CG AC AGAG C AG TGG GG AAAAC TG T 
GACGTGGT 

25 4081 

CGTCAACACCCTGGGGAAGAGGGCCCCAGCCTTTGAGGGCAGGATCCTGCTCAACCCCCAGGACAAGGATGG 
CTCATTCA 

4161 

GTGTGGTGATCACAGGCCTGAGGAAGGAGGATGCAGGGCGCTACCTGTGTGGAGCCCATTCGGATGGTCAGC 
30 TGCAGGAA 

4241 

GGCTCGCCTATCCAGGCCTGGCAACTCTTCGTCAATGAGGAGTCCACGATTCCCCGCAGCCCCACTGTGGTG 
AAGGGGGT 
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4321 

GGCAGGAAGCTCTGTGGCCGTGCTCTGCCCCTACAACCGTAAGGAAAGCAAAAGCATCAAGTACTGGTGTCT 
CTGGGAAG 

4401 

5 GGGCCCAGAATGGCCGCTGCCCCCTGCTGGTGGACAGCGAGGGGTGGGTTAAGGCCCAGTACGAGGGCCGCC 
TCTCCCTG 

4481 

CTGGAGGAGCCAGGCAACGGCACCTTCACTGTCATCCTCAACCAGCTCACCAGCCGGGACGCCGGCTTCTAC 
TGGTGTCT 

10 4561 

GACCAACGGCGATACTCTCTGGAGGACCACCGTGGAGATCAAGATTATCGAAGGAGAACCA7UICCTCAAGGT 
TCCCGGGA 

4641 

ATGTCACGGCTGTGCTGGGAGAGACTCTCAAGGTCCCCTGTCACTTTCCATGCAAATTCTCCTCGTACGAGA 
15 AATACTGG 

4721 

TGCAAGTGGAATAACACGGGCTGCCAGGCCCTGCCCAGCCAAGACGAAGGCCCCAGCAAGGCCTTCGTGAAC 
TGTGACGA 

4801 

20 GAACAGCCGGCTTGTCTCCCTGACCCTGAACCTGGTGACCAGGGCTGATGAGGGCTGGTACTGGTGTGGAGT 
GAAGCAGG 

4881 

GCCACTTCTATGGAGAGACTGCAGCCGTCTATGTGGCAGTTGAAGAGAGGAAGGCAGCGGGGTCCCGCGATG 
TCAGCCTA 

25 4961 

GCGAAGGCAGACGCTGCTCCTGATGAGAAGGTGCTAGACTCTGGTTTTCGGGAGATTGAGAACAAAGCCATT 
CAGGATCC 

5041 

CAGGCTTTTTGCAGAGTGAATTCGTTCGTATCATCGGTTTCGACAACGTTCGTCAAGTTCAATGCATCAGTT 
30 TCATTGCG 

5121 

CACACACCAGAATCCTACTGAGTTCGAGTATTATGGCATTGGGAAAACTGTTTTTCTTGTACCATTTGTTGT 
GCTTGTAA 
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5201 

TTTACTGTGTTTTTTATTCGGTTTTCGCTATCGAACTGTGAAATGGAAATGGATGGAGAAGAGTTAATGAAT 
GATATGGT 

5281 

5 CCTTTTGTTCATTCTCAAATTAATATTATTTGTTTTTTCTCTTATTTGTTGTGTGTTGAATTTGAAATTATA 
AGAGATAT 

5361 

GCAAACATTTTGTTTTGAGTAAAAATGTGTCAAATCGTGGCCTCTAATGACCGAAGTTAATATGAGGAGTAA 
AACACTTG 

10 5441 

TAGTTGTCGACGGTATCGATATTAATTCCCGATCCTATCTGTCACTTCATCAAAAGGACAGTAGAAAAGGAA 
GGTGGCAC 

5521 

CTACAAATGCCATCATTGCGATAAAGGAAAGGCTATCATTCAAGATGCCTCTGCCGACAGTGGTCCCAAAGA 
1 5 TGGACCCC 

5601 

CACCCACGAGGAGCATCGTGGAAAAAGAAGACGTTCCAACCACGTCTTCAAAGCAAGTGGATTGATGTGATA 
TCTCCACT 

5681 

20 GACGTAAGGGATGACGCACAATCCCACTATCCTTCGCAAGACCCTTCCTCTATATAAGGAAGTTCATTTCAT 
TTGGAGAG 

5761 

GACACGCTGAAATCACCAGTCTCTCTCTAGAGTACCATGGAGAACCATTTGCTTTTCTGGGGAGTCCTGGCG 
GTTTTTAT 

25 5841 

TAAGGCTGTTCATGTGAAAGCCCAAGAAGATGAAAGGATTGTTCTTGTTGACAACAAATGTAAGTGTGCCCG 
GATTACTT 

5921 

C C AGG ATC ATC CGTTC TTC CG AAG ATC CT AATG AG G AC ATTG TG G AG AG AAACATC CG AATT ATTGTT C CTC 
30 TGAACAAC 

6001 

AGGGAGAATATCTCTGATCCCACCTCACCATTGAGAACCAGATTTGTGTACCATTTGTCTGACCTCTGTAAA 
AAATGTGA 
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6081 

TCCTACAGAAGTGGAGCTGGATAATCAGATAGTTACTGCTACCCAGAGCAATATCTGTGATGAAGACAGTGC 
TACAGAGA 

6161 

5 CCTGCTACACTTATGACAGAAACAAGTGCTACACAGCTGTGGTCCCACTCGTATATGGTGGTGAGACCAAAA 
TGGTGGAA 

6241 

ACAGCCTTAACCCCAGATGCCTGCTATCCTGACTGAATCCGCGGCGATGAGCTAAGCTAGCTATATCATCAA 
TTTATGTA 

10 6321 

TTACACATAATATCGCACTCAGTCTTTCATCTACGGCAATGTACCAGCTGATATAATCAGTTATTGAAATAT 
TTCTGAAT 

6401 

TTAAACTTGCATCAATAAATTTATGTTTTTGCTTGGACTATAATACCTGACTTGTTATTTTATCAATAAATA 
1 5 TTTAAACT 

6481 

ATATTTCTTTCAAGAGCTCAAAATTGGATTTGTAATAATAAAACGCAATTGTTTGTTATTGTGGCGCTCTAT 
CATAGATG 

6561 

20 TCGCTATAAACCTATTCAGCACAATATATTGTTTTCATTTTAATATTGTACATATAAGTAGTAGGGTACAAT 
CAGTAAAT 

6641 

TGAACGGAGAATATTATTCATAAAAATACGATAGTAACGGGTGATATATTCATTAGAATGAACCGAAACCGG 
CGGTAAGG 

25 6721 

ATCTGAG CTAC AC ATG CTC AGG TTTTTT AC AACGTG C AC AAC AG AATTG AAAG C AAAT AT C ATG CG ATC AT A 
GGCGTCTC 

6801 

GCATATCTCATTAAAGCAGTGAATTCAGATCGGCTGAGTGGCTCCTTCAACGTTGCGGTTCTGTCAGTTCCA 
30 AACGTAAA 

6881 

ACGGCTTGTCCCGCGTCATCGGCGGGGTCATAACGTGACTCCCTTAATTCTCCGCTCATGATCAGATTGTCG 
TTTCCCGC 
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6961 

CTTCAGTTTAAACTATCAGTGTTTGACAGGATATATTGGCGGGTAAACCTAAGAGAAAAGAGCGTTTATTAG 
AATAATCG 

7041 

5 GATATTTAAAAGGGCGTGAAAAGGTTTATCCGTTCGTCCATTTGTATGTGCATGCCAACCACAGGTTCCCCA 
GATCTGGC 



7121 GCCGGCCAG 
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<170> Patentln Ver. 2.1 

<210> 1 

<211> 1596 

<212> DNA 

<213> Homo sapiens 

<400> 1 

atggctccca gcagcccccg gcccgcgctg cccgcactcc tggtcctgct cggggctctg 60 
ttcccaggac ctggcaatgc ccagacatct gtgtccccct caaaagtcat cctgccccgg 120 
ggaggctccg tgctggtgac atgcagcacc tcctgtgacc agcccaagtt gttgggcata 180 
gagaccccgt tgcctaaaaa ggagttgctc ctgcctggga acaaccggaa ggtgtatgaa 24 0 
ctgagcaatg tgcaagaaga tagccaacca atgtgctatt caaactgccc tgatgggcag 3 00 
tcaacagcta aaaccttcct caccgtgtac tggactccag aacgggtgga actggcaccc 3 60 
ctcccctctt ggcagccagt gggcaagaac cttaccctac gctgccaggt ggagggtggg 42 0 
gcaccccggg ccaacctcac cgtggtgctg ctccgtgggg agaaggagct gaaacgggag 480 
ccagctgtgg gggagcccgc tgaggtcacg accacggtgc tggtgaggag agatcaccat 54 0 
ggagccaatt tctcgtgccg cactgaactg gacctgcggc cccaagggct ggagctgttt 600 
gagaacacct cggcccccta ccagctccag acctttgtcc tgccagcgac tcccccacaa 660 
cttgtcagcc cccgggtcct agaggtggac acgcagggga ccgtggtctg ttccctggac 72 0 
gggctgttcc cagtctcgga ggcccaggtc cacctggcac tgggggacca gaggttgaac 780 
cccacagtca cctatggcaa cgactccttc tcggccaagg cctcagtcag tgtgaccgca 840 
gaggacgagg gcacccagcg gctgacgtgt gcagtaatac tggggaacca gagccaggag 900 
acactgcaga cagtgaccat ctacagcttt ccggcgccca acgtgattct gacgaagcca 960 
gaggtctcag aagggaccga ggtgacagtg aagtgtgagg cccaccctag agccaaggtg 102 0 
acgctgaatg gggttccagc ccagccactg ggcccgaggg cccagctcct gctgaaggcc 1080 
accccagagg acaacgggcg cagcttctcc tgctctgcaa ccctggaggt ggccggccag 1140 
cttatacaca agaaccagac ccgggagctt cgtgtcctgt atggcccccg actggacgag 1200 
agggattgtc cgggaaactg gacgtggcca gaaaattccc agcagactcc aatgtgccag 1260 
gcttggggga acccattgcc cgagctcaag tgtctaaagg atggcacttt cccactgccc 1320 
atcggggaat cagtgactgt cactcgagat cttgagggca cctacctctg tcgggccagg 13 80 
agcactcaag gggaggtcac ccgcaaggtg accgtgaatg tgctctcccc ccggtatgag 1440 
attgtcatca tcactgtggt agcagccgca gtcataatgg gcactgcagg cctcagcacg 1500 
tacctctata accgccagcg gaagatcaag aaatacagac tacaacaggc ccaaaaaggg 1560 
acccccatga aaccgaacac acaagccacg cctccc 1596 



<210> 2 
<211> 532 
<212> PRT 
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<213> Homo sapiens 
<400> 2 

Met Ala Pro Ser Ser Pro Arg Pro Ala Leu Pro Ala Leu Leu Val Leu 
15 10 15 

Leu Gly Ala Leu Phe Pro Gly Pro Gly Asn Ala Gin Thr Ser Val Ser 
20 25 30 

Pro Ser Lys Val lie Leu Pro Arg Gly Gly Ser Val Leu Val Thr Cys 
35 40 45 

Ser Thr Ser Cys Asp Gin Pro Lys Leu Leu Gly lie Glu Thr Pro Leu 
50 55 60 

Pro Lys Lys Glu Leu Leu Leu Pro Gly Asn Asn Arg Lys Val Tyr Glu 
65 70 75 80 

Leu Ser Asn Val Gin Glu Asp Ser Gin Pro Met Cys Tyr Ser Asn Cys 
85 90 95 

Pro Asp Gly Gin Ser Thr Ala Lys Thr Phe Leu Thr Val Tyr Trp Thr 
100 105 110 

Pro Glu Arg Val Glu Leu Ala Pro Leu Pro Ser Trp Gin Pro Val Gly 
115 120 125 

Lys Asn Leu Thr Leu Arg Cys Gin Val Glu Gly Gly Ala Pro Arg Ala 
130 135 140 

Asn Leu Thr Val Val Leu Leu Arg Gly Glu Lys Glu Leu Lys Arg Glu 
145 150 155 160 

Pro Ala Val Gly Glu Pro Ala Glu Val Thr Thr Thr Val Leu Val Arg 
165 170 175 

Arg Asp His His Gly Ala Asn Phe Ser Cys Arg Thr Glu Leu Asp Leu 
180 185 190 

Arg Pro Gin Gly Leu Glu Leu Phe Glu Asn Thr Ser Ala Pro Tyr Gin 
195 200 205 

Leu Gin Thr Phe Val Leu Pro Ala Thr Pro Pro Gin Leu Val Ser Pro 
210 215 220 

Arg Val Leu Glu Val Asp Thr Gin Gly Thr Val Val Cys Ser Leu Asp 
225 230 235 240 

Gly Leu Phe Pro Val Ser Glu Ala Gin Val His Leu Ala Leu Gly Asp 
245 250 255 

Gin Arg Leu Asn Pro Thr Val Thr Tyr Gly Asn Asp Ser Phe Ser Ala 
260 265 270 

Lys Ala Ser Val Ser Val Thr Ala Glu Asp Glu Gly Thr Gin Arg Leu 
275 280 285 

Thr Cys Ala Val lie Leu Gly Asn Gin Ser Gin Glu Thr Leu Gin Thr 
290 295 300 



Val Thr lie Tyr Ser Phe Pro Ala Pro Asn Val lie Leu Thr Lys Pro 
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Glu Val Ser Glu Gly Thr Glu Val Thr Val Lys Cys Glu Ala His Pro 
325 330 335 

Arg Ala Lys Val Thr Leu Asn Gly Val Pro Ala Gin Pro Leu Gly Pro 
340 345 350 

Arg Ala Gin Leu Leu Leu Lys Ala Thr Pro Glu Asp Asn Gly Arg Ser 
355 360 365 

Phe Ser Cys Ser Ala Thr Leu Glu Val Ala Gly Gin Leu lie His Lys 
370 375 380 

Asn Gin Thr Arg Glu Leu Arg Val Leu Tyr Gly Pro Arg Leu Asp Glu 
385 390 395 400 

Arg Asp Cys Pro Gly Asn Trp Thr Trp Pro Glu Asn Ser Gin Gin Thr 
405 410 415 

Pro Met Cys Gin Ala Trp Gly Asn Pro Leu Pro Glu Leu Lys Cys Leu 
420 425 430 

Lys Asp Gly Thr Phe Pro Leu Pro lie Gly Glu Ser Val Thr Val Thr 
435 440 445 

Arg Asp Leu Glu Gly Thr Tyr Leu Cys Arg Ala Arg Ser Thr Gin Gly 
450 455 460 

Glu Val Thr Arg Lys Val Thr Val Asn Val Leu Ser Pro Arg Tyr Glu 
465 470 475 480 

lie Val lie lie Thr Val Val Ala Ala Ala Val lie Met Gly Thr Ala 
485 490 495 

Gly Leu Ser Thr Tyr Leu Tyr Asn Arg Gin Arg Lys lie Lys Lys Tyr 
500 505 510 

Arg Leu Gin Gin Ala Gin Lys Gly Thr Pro Met Lys Pro Asn Thr Gin 
515 520 525 



Ala Thr Pro Pro 
530 



<210> 3 

<211> 3003 

<212> DNA 

<213> Homo sapiens 



<400> 3 

gctataagga tcacgcgccc cagtcgacgc tgagctcctc tgctactcag agttgcaacc 60 
tcagcctcgc tatggctccc agcagccccc ggcccgcgct gcccgcactc ctggtcctgc 12 0 
tcggggctct gttcccagga cctggcaatg cccagacatc tgtgtccccc tcaaaagtca 180 
tcctgccccg gggaggctcc gtgctggtga catgcagcac ctcctgtgac cagcccaagt 24 0 
tgttgggcat agagaccccg ttgcctaaaa aggagttgct cctgcctggg aacaaccgga 300 
aggtgtatga actgagcaat gtgcaagaag atagccaacc aatgtgctat tcaaactgcc 360 
ctgatgggca gtcaacagct aaaaccttcc tcaccgtgta ctggactcca gaacgggtgg 42 0 
aactggcacc cctcccctct tggcagccag tgggcaagaa ccttacccta cgctgccagg 480 
tggagggtgg ggcaccccgg gccaacctca ccgtggtgct gctccgtggg gagaaggagc 54 0 
tgaaacggga gccagctgtg ggggagcccg ctgaggtcac gaccacggtg ctggtgagga 600 
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gagatcacca 
tggagctgtt 
ctcccccaca 
gttccctgga 
agaggttgaa 
gtgtgaccgc 
agagccagga 
tgacgaagcc 
gagccaaggt 
tgctgaaggc 
tggccggcca 
gactggacga 
caatgtgcca 
tcccactgcc 
gtcgggccag 
cccggtatga 
gcctcagcac 
cccaaaaagg 
ggacagggcc 
tggaagacat 
tgtcctcagt 
gccacgcatc 
atgattgatg 
cccaccatga 
ggccccacag 
aggcccacac 
gatgatatgt 
ggctaaatga 
caccaggtac 
ggggctggga 
gcacaaaagc 
cttattcctc 
atttctgcca 
atatgcccaa 
attgcagctc 
ggtattggag 
atgtgtagac 
cactgcagtc 
gggaccatag 
acggggtctc 
gcc 



tggagccaat 
tgagaacacc 
acttgtcagc 
cgggctgttc 
ccccacagtc 
agaggacgag 
gacactgcag 
agaggtctca 
gacgctgaat 
caccccagag 
gcttatacac 
gagggattgt 
ggcttggggg 
catcggggaa 
gagcactcaa 
gattgtcatc 
gtacctctat 
gacccccatg 
tcttcctcgg 
atgccatgca 
cagatacaac 
tgatctgtag 
gatgttaaag 
ggacatacaa 
acttacagaa 
ttcctgacgg 
atttattcat 
acataggtct 
agttgtacag 
cttctcattg 
actatatgga 
ccttcccccc 
gtgttcacaa 
gctatgcctt 
cagtttcctg 
gactccctcc 
aagctctcgc 
ttgacctttt 
gctcacaaca 
gcaacattgc 



ttctcgtgcc 
tcggccccct 
ccccgggtcc 
ccagtctcgg 
acctatggca 
ggcacccagc 
acagtgacca 
gaagggaccg 
ggggttccag 
gacaacgggc 
aagaaccaga 
ccgggaaact 
aacccattgc 
tcagtgactg 
ggggaggtca 
atcactgtgg 
aaccgccagc 
aaaccgaaca 
ccttcccata 
gctacaccta 
agcatttggg 
tcacatgact 
tctagcctga 
ctgggaaata 
gaagtggccc 
atgccagctt 
ttgttatttt 
ctggcctcac 
gttgtacact 
gccaacctgc 
ctggtaatgg 
aaaactgaca 
tgacactcag 
gtcctcttgt 
cagtgatcag 
cagctttgga 
tctgtcaccc 
gggctcaagt 
ccacacctgg 
ccagacttcc 
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gcactgaact 
accagctcca 
tagaggtgga 
aggcccaggt 
acgactcctt 
ggctgacgtg 
tctacagctt 
aggtgacagt 
cccagccact 
gcagcttctc 
cccgggagct 
ggacgtggcc 
ccgagctcaa 
tcactcgaga 
cccgcaaggt 
tagcagccgc 
ggaagatcaa 
cacaagccac 
ttggtggcag 
ccggccctgg 
gccatggtac 
aagccaagag 
tgagagggga 
ctgaaacttg 
tccatagaca 
gggcactgct 
accagctatt 
ggagctccca 
gcaggagagt 
ctttccccag 
ttcacaggtt 
cctttgttag 
cggtcatgtc 
cctgtttgca 
ggtcctgcaa 
agcctcatcc 
aggctggagt 
gatcctccca 
caaatttgat 
tttgtgttag 



ggacctgcgg 
gacctttgtc 
cacgcagggg 
ccacctggca 
ctcggccaag 
tgcagtaata 
tccggcgccc 
gaagtgtgag 
gggcccgagg 
ctgctctgca 
tcgtgtcctg 
agaaaattcc 
gtgtctaaag 
tcttgagggc 
gaccgtgaat 
agtcataatg 
gaaatacaga 
gcctccctga 
tggtgccaca 
gacgccggag 
ctgcacacct 
gaaggagcaa 
agtggtgggg 
ctgcctattg 
tgtgtagcat 
gtctactgac 
tattgagtgt 
gtccatgtca 
gcctggcaaa 
aaggagtgat 
cagagattac 
ccacctcccc 
tggacatgag 
tttcactggg 
gcagtgggga 
gcgtgtgtgt 
gcagtggtgc 
cctcagcctc 
tttttttttt 
ttaataaagc 



ccccaagggc 
ctgccagcga 
accgtggtct 
ctgggggacc 
gcctcagtca 
ctggggaacc 
aacgtgattc 
gcccacccta 
gcccagctcc 
accctggagg 
tatggccccc 
cagcagactc 
gatggcactt 
acctacctct 
gtgctctccc 
ggcactgcag 
ctacaacagg 
acctatcccg 
ctgaacagag 
gacagggcat 
aaaacactag 
gactcaagac 
gagacatagc 
ggtatgctga 
caaaacacaa 
cccaaccctt 
cttttatgta 
cattcaaggt 
aagatcaaat 
ttttctatcg 
ccagtgaggc 
acccacatac 
tgcccaggga 
agcttgcact 
agggggccaa 
gtgtgtgtgt 
aatcatggtt 
ctgagtagct 
tttttcagag 
tttctcaact 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3003 



<210> 4 
<211> 6 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Ser Glu Lys Asp Glu Leu 
1 5 



<210> 5 
<211> 7 
<212> PRT 

<213> Homo sapiens 



<400> 5 

Arg Ser Glu Lys Asp Glu Leu 
1 5 
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<210> 6 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cloning primer 
<400> 6 

tctgttccca ggaactagtt tggcacagac atctgtgtcc ccctcaaaag tc 52 



<210> 7 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cloning primer 
<400> 7 

cataccgggg actagtcaca ttcacggtca cctcgcgg 38 



<210> 8 

<211> 799 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Gin Thr Ser Val Ser Pro Ser Lys Val lie Leu Pro Arg Gly Gly Ser 
15 10 15 

Val Leu Val Thr Cys Ser Thr Ser Cys Asp Gin Pro Lys Leu Leu Gly 
20 25 30 

lie Glu Thr Pro Leu Pro Lys Lys Glu Leu Leu Leu Pro Gly Asn Asn 
35 40 45 

Arg Lys Val Tyr Glu Leu Ser Asn Val Gin Glu Asp Ser Gin Pro Met 
50 55 60 

Cys Tyr Ser Asn Cys Pro Asp Gly Gin Ser Thr Ala Lys Thr Phe Leu 
65 70 75 80 

Thr Val Tyr Trp Thr Pro Glu Arg Val Glu Leu Ala Pro Leu Pro Ser 
85 90 95 

Trp Gin Pro Val Gly Lys Asn Leu Thr Leu Arg Cys Gin Val Glu Gly 
100 105 110 

Gly Ala Pro Arg Ala Asn Leu Thr Val Val Leu Leu Arg Gly Glu Lys 
115 120 125 

Glu Leu Lys Arg Glu Pro Ala Val Gly Glu Pro Ala Glu Val Thr Thr 
130 135 140 



Thr Val Leu Val Arg Arg Asp His His Gly Ala Asn Phe Ser Cys Arg 
145 150 155 160 
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Thr Glu Leu Asp 



Ser Ala Pro Tyr 
180 

Gin Leu Val Ser 
195 

Val Cys Ser Leu 
210 

Leu Ala Leu Gly 
225 

Asp Ser Phe Ser 



Gly Thr Gin Arg 
260 

Glu Thr Leu Gin 
275 

lie Leu Thr Lys 
290 

Cys Glu Ala His 
305 

Gin Pro Leu Gly 



Asp Asn Gly Arg 
340 

Gin Leu lie His 
355 

Pro Arg Leu Asp 
370 

Asn Ser Gin Gin 
385 

Glu Leu Lys Cys 



Ser Val Thr Val 
420 

Arg Ser Thr Gin 
435 

Ser Gly Ser Ser 
450 

Leu Asp Ser Thr 
465 



Leu Arg Pro Gin 
165 

Gin Leu Gin Thr 



Pro Arg Val Leu 
200 

Asp Gly Leu Phe 
215 

Asp Gin Arg Leu 
230 

Ala Lys Ala Ser 
245 

Leu Thr Cys Ala 



Thr Val Thr lie 
280 

Pro Glu Val Ser 
295 

Pro Arg Ala Lys 
310 

Pro Arg Ala Gin 
325 

Ser Phe Ser Cys 



Lys Asn Gin Thr 
360 

Glu Arg Asp Cys 
375 

Thr Pro Met Cys 
390 

Leu Lys Asp Gly 
405 

Thr Arg Asp Leu 



Gly Glu Val Thr 
440 

Ala Ser Pro Thr 
455 

Pro Gin Asp Gly 
470 



Gly Leu Glu Leu 
170 

Phe Val Leu Pro 
185 

Glu Val Asp Thr 



Pro Val Ser Glu 
220 

Asn Pro Thr Val 
235 

Val Ser Val Thr 
250 

Val lie Leu Gly 
265 

Tyr Ser Phe Pro 



Glu Gly Thr Glu 
300 

Val Thr Leu Asn 
315 

Leu Leu Leu Lys 
330 

Ser Ala Thr Leu 
345 

Arg Glu Leu Arg 



Pro Gly Asn Trp 
380 

Gin Ala Trp Gly 
395 

Thr Phe Pro Leu 
410 

Glu Gly Thr Tyr 
425 

Arg Glu Val Thr 



Ser Pro Lys Val 
460 

Asn Val Val Val 
475 



Phe Glu Asn Thr 
175 

Ala Thr Pro Pro 
190 

Gin Gly Thr Val 
205 

Ala Gin Val His 



Thr Tyr Gly Asn 
240 

Ala Glu Asp Glu 
255 

Asn Gin Ser Gin 
270 

Ala Pro Asn Val 
285 

Val Thr Val Lys 



Gly Val Pro Ala 
320 

Ala Thr Pro Glu 
335 

Glu Val Ala Gly 
350 

Val Leu Tyr Gly 
365 

Thr Trp Pro Glu 



Asn Pro Leu Pro 
400 

Pro lie Gly Glu 
415 

Leu Cys Arg Ala 
430 

Val Asn Val Thr 
445 

Phe Pro Leu Ser 



Ala Cys Leu Val 
480 
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Gin Gly Phe Phe Pro Gin Glu Pro Leu Ser Val Thr Trp Ser Glu Ser 
485 490 495 

Gly Gin Asn Val Thr Ala Arg Asn Phe Pro Pro Ser Gin Asp Ala Ser 
500 505 510 

Gly Asp Leu Tyr Thr Thr Ser Ser Gin Leu Thr Leu Pro Ala Thr Gin 
515 520 525 

Cys Pro Asp Gly Lys Ser Val Thr Cys His Val Lys His Tyr Thr Asn 
530 535 540 

Ser Ser Gin Asp Val Thr Val Pro Cys Arg Val Pro Pro Pro Pro Pro 
545 550 555 560 

Cys Cys His Pro Arg Leu Ser Leu His Arg Pro Ala Leu Glu Asp Leu 
565 570 575 

Leu Leu Gly Ser Glu Ala Asn Leu Thr Cys Thr Leu Thr Gly Leu Arg 
580 585 590 

Asp Ala Ser Gly Ala Thr Phe Thr Trp Thr Pro Ser Ser Gly Lys Ser 
595 600 605 

Ala Val Gin Gly Pro Pro Glu Arg Asp Leu Cys Gly Cys Tyr Ser Val 
610 615 620 

Ser Arg Val Leu Pro Gly Cys Ala Gin Pro Trp Asn His Gly Glu Thr 
625 630 635 640 

Phe Thr Cys Thr Ala Ala His Pro Glu Leu Lys Thr Pro Leu Thr Ala 
645 650 655 

Asn lie Thr Lys Ser Gly Asn Thr Phe Arg Pro Glu Val His Leu Leu 
660 665 670 

Pro Pro Pro Ser Glu Glu Leu Ala Leu Asn Glu Leu Val Thr Leu Thr 
675 680 685 

Cys Leu Ala Arg Gly Phe Ser Pro Lys Asp Val Leu Val Arg Trp Leu 
690 695 700 

Gin Gly Ser Gin Glu Leu Pro Arg Glu Lys Tyr Leu Thr Trp Ala Ser 
705 710 715 720 

Arg Gin Glu Pro Ser Gin Gly Thr Thr Thr Tyr Ala Val Thr Ser lie 
725 730 735 

Leu Arg Val Ala Ala Glu Asp Trp Lys Lys Gly Glu Thr Phe Ser Cys 
740 745 750 

Met Val Gly His Glu Ala Leu Pro Leu Ala Phe Thr Gin Lys Thr lie 
755 760 765 

Asp Arg Leu Ala Gly Lys Pro Thr His lie Asn Val Ser Val Val Met 
770 775 780 



Ala Glu Ala Asp Gly Thr Cys Tyr Arg Ser Glu Lys Asp Glu Leu 
785 790 795 
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<210> 9 
<211> 6313 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Expression-type plasmid pSSPICAMHuA2 

<400> 9 

gaactcgagc agctgaagct tgcatgcctg caggtcgacg gtatcgataa ggatccctga 60 
aagcgacgtt ggatgttaac atctacaaat tgccttttct tatcgaccat gtacgtaagc 12 0 
gcttacgttt ttggtggacc cttgaggaaa ctggtagctg ttgtgggcct gtggtctcaa 180 
gatggatcat taatttccac cttcacctac gatggggggc atcgcaccgg tgagtaatat 24 0 
tgtacggcta agagcgaatt tggcctgtag gatccctgaa agcgacgttg gatgttaaca 300 
tctacaaatt gccttttctt atcgaccatg tacgtaagcg cttacgtttt tggtggaccc 360 
ttgaggaaac tggtagctgt tgtgggcctg tggtctcaag atggatcatt aatttccacc 42 0 
ttcacctacg atggggggca tcgcaccggt gagtaatatt gtacggctaa gagcgaattt 480 
ggcctgtagg atccctgaaa gcgacgttgg atgttaacat ctacaaattg ccttttctta 540 
tcgaccatgt acgtaagcgc ttacgttttt ggtggaccct tgaggaaact ggtagctgtt 600 
gtgggcctgt ggtctcaaga tggatcatta atttccacct tcacctacga tggggggcat 660 
cgcaccggtg agtaatattg tacggctaag agcgaatttg gcctgtagga tccgcgagct 720 
ggtcaatccc attgcttttg aagcagctca acattgatct ctttctcgag ggagattttt 780 
caaatcagtg cgcaagacgt gacgtaagta tccgagtcag tttttatttt tctactaatt 840 
tggtcgttta tttcggcgtg taggacatgg caaccgggcc tgaatttcgc gggtattctg 900 
tttctattcc aactttttct tgatccgcag ccattaacga cttttgaata gatacgctga 960 
cacgccaagc ctcgctagtc aaaagtgtac caaacaacgc tttacagcaa gaacggaatg 102 0 
cgcgtgacgc tcgcggtgac gccatttcgc cttttcagaa atggataaat agccttgctt 1080 
cctattatat cttcccaaat taccaataca ttacactagc atctgaattt cataaccaat 1140 
ctcgatacac caaatcgact ctagaggatc tatcgattcc cgggtaccat gggatctaaa 12 00 
ccttttttgt ctcttctttc attgtcattg cttttgttta catctactag tttggcacag 1260 
acatctgtgt ccccctcaaa agtcatcctg ccccggggag gctccgtgct ggtgacatgc 1320 
agcacctcct gtgaccagcc caagttgttg ggcatagaga ccccgttgcc taaaaaggag 13 80 
ttgctcctgc ctgggaacaa ccggaaggtg tatgaactga gcaatgtgca agaagatagc 144 0 
caaccaatgt gctattcaaa ctgccctgat gggcagtcaa cagctaaaac cttcctcacc 1500 
gtgtactgga ctccagaacg ggtggaactg gcacccctcc cctcttggca gccagtgggc 1560 
aagaacctta ccctacgctg ccaggtggag ggtggggcac cccgggccaa cctcaccgtg 162 0 
gtgctgctcc gtggggagaa ggagctgaaa cgggagccag ctgtggggga gcccgctgag 168 0 
gtcacgacca cggtgctggt gaggagagat caccatggag ccaatttctc gtgccgcact 174 0 
gaactggacc tgcggcccca agggctggag ctgtttgaga acacctcggc cccctaccag 1800 
ctccagacct ttgtcctgcc agcgactccc ccacaacttg tcagcccccg ggtcctagag 1860 
gtggacacgc aggggaccgt ggtctgttcc ctggacgggc tgttcccagt ctcggaggcc 192 0 
caggtccacc tggcactggg ggaccagagg ttgaacccca cagtcaccta tggcaacgac 1980 
tccttctcgg ccaaggcctc agtcagtgtg accgcagagg acgagggcac ccagcggctg 2 04 0 
acgtgtgcag taatactggg gaaccagagc caggagacac tgcagacagt gaccatctac 2100 
agctttccgg cgcccaacgt gattctgacg aagccagagg tctcagaagg gaccgaggtg 2160 
acagtgaagt gtgaggccca ccctagagcc aaggtgacgc tgaatggggt tccagcccag 222 0 
ccactgggcc cgagggccca gctcctgctg aaggccaccc cagaggacaa cgggcgcagc 22 80 
ttctcctgct ctgcaaccct ggaggtggcc ggccagctta tacacaagaa ccagacccgg 2340 
gagcttcgtg tcctgtatgg cccccgactg gacgagaggg attgtccggg aaactggacg 24 00 
tggccagaaa attcccagca gactccaatg tgccaggctt gggggaaccc attgcccgag 2460 
ctcaagtgtc taaaggatgg cactttccca ctgcccatcg gggaatcagt gactgtcact 2520 
cgagatcttg agggcaccta cctctgtcgg gccaggagca ctcaagggga ggtcacccgc 2580 
gaggtgaccg tgaatgtgac tagtgggagc tcagcatccc cgaccagccc caaggtcttc 2 640 
ccgctgagcc tcgacagcac cccccaagat gggaacgtgg tcgtcgcatg cctggtccag 2700 
ggcttcttcc cccaggagcc actcagtgtg acctggagcg aaagcggaca gaacgtgacc 2760 
gccagaaact tcccacctag ccaggatgcc tccggggacc tgtacaccac gagcagccag 2 82 0 
ctgaccctgc cggccacaca gtgcccagac ggcaagtccg tgacatgcca cgtgaagcac 2880 
tacacgaatt ccagccagga tgtgactgtg ccctgccgag ttcccccacc tcccccatgc 2940 
tgccaccccc gactgtcgct gcaccgaccg gccctcgagg acctgctctt aggttcagaa 3000 
gcgaacctca cgtgcacact gaccggcctg agagatgcct ctggtgccac cttcacctgg 3060 
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acgccctcaa 
tacagcgtgt 
acctgcactg 
ggaaacacat 
aacgagctgg 
cgctggctgc 
caggagccca 
gaggactgga 
gccttcacac 
gttgtcatgg 
tcgatatcaa 
atcctgttgc 
taataattaa 
cgcaattata 
tatcgcgcgc 
gtcgtattaa 
agaggcggtt 
gtcgttcggc 
gaatcagggg 
cgtaaaaagg 
aaaaatcgac 
tttccccctg 
ctgtccgcct 
ctcagttcgg 
cccgaccgct 
ttatcgccac 
gctacagagt 
atctgcgctc 
aaacaaacca 
aaaaaaggat 
gaaaactcac 
cttttaaatt 
gacagttacc 
tccatagttg 
ggccccagtg 
ataaaccagc 
atccagtcta 
cgcaacgttg 
tcattcagct 
aaagcggtta 
tcactcatgg 
ttttctgtga 
agttgctctt 
gtgctcatca 
agatccagtt 
accagcgttt 
gcgacacgga 
cagggttatt 
ggggttccgc 
atgacattaa 
gatgacggtg 
gcggatgccg 
ggctggctta 
ttgtcgttag 

aggtgacact 



gtgggaagag 
ccagagtact 
ctgcccaccc 
tccggcccga 
tgacgctgac 
aggggtcaca 
gccagggcac 
agaaggggga 
agaagaccat 
cggaggcgga 
gctaattccc 
cggtcttgcg 
catgtaatgc 
catttaatac 
ggtgtcatct 
tttcgataag 
tgcgtattgg 
tgcggcgagc 
ataacgcagg 
ccgcgttgct 
gctcaagtca 
gaagctccct 
ttctcccttc 
tgtaggtcgt 
gcgccttatc 
tggcagcagc 
tcttgaagtg 
tgctgaagcc 
ccgctggtag 
ctcaagaaga 
gttaagggat 
aaaaatgaag 
aatgcttaat 
cctgactccc 
ctgcaatgat 
cagccggaag 
ttaattgttg 
ttgccattgc 
ccggttccca 
gctccttcgg 
ttatggcagc 
ctggtgagta 
gcccggcgtc 
ttggaaaacg 
cgatgtaacc 
ctgggtgagc 
aatgttgaat 
gtctcatgag 
gcacatttcc 
cctataaaaa 
aaaacctctg 
ggagcagaca 
actatgcggc 
aacgcggcta 
ata 



cgctgttcaa 
tcctggctgt 
cgagttgaag 
ggtccacctg 
gtgcctggca 
ggagctgccc 
caccacctat 
gaccttctcc 
cgaccgcttg 
cggcacctgc 
gatcgttcaa 
atgattatca 
atgacgttat 
gcgatagaaa 
atgttactag 
ccaggttaac 
gcgctcttcc 
ggtatcagct 
aaagaacatg 
ggcgtttttc 
gaggtggcga 
cgtgcgctct 
gggaagcgtg 
tcgctccaag 
cggtaactat 
cactggtaac 
gtggcctaac 
agttaccttc 
cggtggtttt 
tcctttgatc 
tttggtcatg 
ttttaaatca 
cagtgaggca 
cgtcgtgtag 
accgcgagac 
ggccgagcgc 
ccgggaagct 
tacaggcatc 
acgatcaagg 
tcctccgatc 
actgcataat 
ctcaaccaag 
aatacgggat 
ttcttcgggg 
cactcgtgca 
aaaaacagga 
actcatactc 
cggatacata 
ccgaaaagtg 
taggcgtatc 
acacatgcag 
agcccgtcag 
atcagagcag 
caattaatac 
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ggaccacctg 
gcccagccat 
accccactaa 
ctgccgccgc 
cgtggcttca 
cgcgagaagt 
gctgtgacca 
tgcatggtgg 
gcgggtaaac 
tacagatctg 
acatttggca 
tataatttct 
ttatgagatg 
acaaaatata 
atcggggatc 
ctgcattaat 
gcttcctcgc 
cactcaaagg 
tgagcaaaag 
cataggctcc 
aacccgacag 
cctgttccga 
gcgctttctc 
ctgggctgtg 
cgtcttgagt 
aggattagca 
tacggctaca 
ggaaaaagag 
tttgtttgca 
ttttctacgg 
agattatcaa 
atctaaagta 
cctatctcag 
ataactacga 
ccacgctcac 
agaagtggtc 
agagtaagta 
gtggtgtcac 
cgagttacat 
gttgtcagaa 
tctcttactg 
tcattctgag 
aataccgcgc 
cgaaaactct 
cccaactgat 
aggcaaaatg 
ttcctttttc 
tttgaatgta 
ccacctgacg 
acgaggccct 
ctcccggaga 
ggcgcgtcag 
attgtactga 
ataaccttat 



agcgtgacct 
ggaaccatgg 
ccgccaacat 
cgtcggagga 
gccccaagga 
acctgacttg 
gcatactgcg 
gccacgaggc 
ccacccatat 
aaaaggatga 
ataaagtttc 
gttgaattac 
ggtttttatg 
gcgcgcaaac 
tgccggtctc 
gaatcggcca 
tcactgactc 
cggtaatacg 
gccagcaaaa 
gcccccctga 
gactataaag 
ccctgccgct 
aatgctcacg 
tgcacgaacc 
ccaacccggt 
gagcgaggta 
ctagaaggac 
ttggtagctc 
agcagcagat 
ggtctgacgc 
aaaggatctt 
tatatgagta 
cgatctgtct 
tacgggaggg 
cggctccaga 
ctgcaacttt 
gttcgccagt 
gctcgtcgtt 
gatcccccat 
gtaagttggc 
tcatgccatc 
aatagtgtat 
cacatagcag 
caaggatctt 
cttcagcatc 
ccgcaaaaaa 
aatattattg 
tttagaaaaa 
tctaagaaac 
ttcgtctcgc 
cggtcacagc 
cgggtgttgg 
gagtgcacca 
gtatcataca 



ct 9tggctgc 
ggagaccttc 
cacaaaatcc 
gctggccctg 
tgtgctggtt 
ggcatcccgg 
cgtggcagcc 
cctgccgctg 
caatgtgtct 
actttagaat 
ttaagattga 
gttaagcatg 
attagagtcc 
taggataaat 
cctatagtga 
acgcgcgggg 
gctgcgctcg 
gttatccaca 
ggccaggaac 
cgagcatcac 
ataccaggcg 
taccggatac 
ctgtaggtat 
ccccgttcag 
aagacacgac 
tgtaggcggt 
agtatttggt 
ttgatccggc 
tacgcgcaga 
tcagtggaac 
cacctagatc 
aacttggtct 
atttcgttca 
cttaccatct 
tttatcagca 
atccgcctcc 
taatagtttg 
tggtatggct 
gttgtgcaaa 
cgcagtgtta 
cgtaagatgc 
gcggcgaccg 
aactttaaaa 
accgctgttg 
ttttactttc 
gggaataagg 
aagcatttat 
taaacaaata 
cattattatc 
gcgtttcggt 
ttgtctgtaa 
cgggtgtcgg 
tatggacata 
catacgattt 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6313 



<210> 10 
<211> 22 
<212> PRT 
<213> Phaseolus 



vulgaris 
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<400> 10 

Met Ser Lys Pro Phe Leu Ser Leu Leu Ser Leu Ser Leu Leu Leu Phe 
15 10 15 

Thr Ser Thr Cys Leu Ala 
20 



<210> 11 
<211> 508 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
region of the plasmid pSHuJ 



Protein coding 



<400> 11 

aggatctatc 

gtttttatta 

aacaaatgta 

gaggacattg 

tctgatccca 

aaatgtgatc 

atctgtgatg 

acagctgtgg 

ccagatgcct 



gattcccggg 
aggctgttca 
agtgtgcccg 
tggagagaaa 
cctcaccatt 
ctacagaagt 
aagacagtgc 
tcccactcgt 
gctatcctga 



taccatggag 
tgtgaaagcc 
gattacttcc 
catccgaatt 
gagaaccaga 
ggagctggat 
tacagagacc 
atatggtggt 
ctgaattc 



aaccatttgc 
caagaagatg 
aggatcatcc 
attgttcctc 
tttgtgtacc 
aatcagatag 
tgctacactt 
gagaccaaaa 



ttttctgggg 
aaaggattgt 
gttcttccga 
tgaacaacag 
atttgtctga 
ttactgctac 
atgacagaaa 
tggtggaaac 



agtcctggcg 60 
tcttgttgac 120 
agatcctaat 180 
ggagaatatc 240 
cctctgtaaa 300 
ccagagcaat 360 
caagtgctac 420 
agccttaacc 480 
508 



<210> 12 
<211> 1845 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Protein coding 
region of plasmid pSHuSC 

<400> 12 

gtcgattccc gggtaccatg gtgctcttcg tgctcacctg cctgctggcg gtcttcccag 60 
ccatctccac gaagagtccc atatttggtc ccgaggaggt gaatagtgtg gaaggtaact 12 0 
cagtgtccat cacgtgctac tacccaccca cctctgtcaa ccggcacacc cggaagtact 180 
ggtgccggca gggagctaga ggtggctgca taaccctcat ctcctcggag ggctacgtct 24 0 
ccagcaaata tgcaggcagg gctaacctca ccaacttccc ggagaacggc acatttgtgg 3 00 
tgaacattgc ccagctgagc caggatgact ccgggcgcta caagtgtggc ctgggcatca 3 60 
atagccgagg cctgtccttt gatgtcagcc tggaggtcag ccagggtcct gggctcctaa 420 
atgacactaa agtctacaca gtggacctgg gcagaacggt gaccatcaac tgccctttca 480 
agactgagaa tgctcaaaag aggaagtcct tgtacaagca gataggcctg taccctgtgc 540 
tggtcatcga ctccagtggt tatgtgaatc ccaactatac aggaagaata cgccttgata 600 
ttcagggtac tggccagtta ctgttcagcg ttgtcatcaa ccaactcagg ctcagcgatg 660 
ctgggcagta tctctgccag gctggggatg attccaatag taataagaag aatgctgacc 72 0 
tccaagtgct aaagcccgag cccgagctgg tttatgaaga cctgaggggc tcagtgacct 780 
tccactgtgc cctgggccct gaggtggcaa acgtggccaa atttctgtgc cgacagagca 84 0 
gtggggaaaa ctgtgacgtg gtcgtcaaca ccctggggaa gagggcccca gcctttgagg 900 
gcaggatcct gctcaacccc caggacaagg atggctcatt cagtgtggtg atcacaggcc 960 
tgaggaagga ggatgcaggg cgctacctgt gtggagccca ttcggatggt cagctgcagg 102 0 
aaggctcgcc tatccaggcc tggcaactct tcgtcaatga ggagtccacg attccccgca 1080 
gccccactgt ggtgaagggg gtggcaggaa gctctgtggc cgtgctctgc ccctacaacc 1140 
gtaaggaaag caaaagcatc aagtactggt gtctctggga aggggcccag aatggccgct 1200 
gccccctgct ggtggacagc gaggggtggg ttaaggccca gtacgagggc cgcctctccc 1260 
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tgctggagga gccaggcaac ggcaccttca ctgtcatcct caaccagctc accagccggg 132 0 

acgccggctt ctactggtgt ctgaccaacg gcgatactct ctggaggacc accgtggaga 1380 

tcaagattat cgaaggagaa ccaaacctca aggttcccgg gaatgtcacg gctgtgctgg 1440 

gagagactct caaggtcccc tgtcactttc catgcaaatt ctcctcgtac gagaaatact 1500 

ggtgcaagtg gaataacacg ggctgccagg ccctgcccag ccaagacgaa ggccccagca 1560 

aggccttcgt gaactgtgac gagaacagcc ggcttgtctc cctgaccctg aacctggtga 162 0 

ccagggctga tgagggctgg tactggtgtg gagtgaagca gggccacttc tatggagaga 1680 

ctgcagccgt ctatgtggca gttgaagaga ggaaggcagc ggggtcccgc gatgtcagcc 1740 

tagcgaaggc agacgctgct cctgatgaga aggtgctaga ctctggtttt cgggagattg 1800 

agaacaaagc cattcaggat cccaggcttt ttgcagagtg aattc 1845 



<210> 13 
<211> 4465 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Express ion- type plasmid pBMSP-1 

<220> 

<221> modif ied_base 

<222> (2150) 

<223> a, c, t or g 

<220> 

<221> modif ied_base 
<222> (2214) . . (2215) 
<223> a, c, t or g 



<400> 13 

ctggccggcg 

ataaaccttt 

ggtttacccg 

caatctgatc 

gacaagccgt 

gatctgaatt 

gtttgcgcgc 

cataaaaacc 

gtaattcaac 

gaaactttat 

ccgctctaga 

atcgataccg 

gattggttat 

gaagcaaggc 

gcattccgtt 

gtcagcgtat 

acagaatacc 

aaattagtag 

gaaaaatctc 

tgaccagctc 

gtgcgatgcc 

cccacaacag 

gtcgataaga 

aggccaaatt 

gtgaaggtgg 

ctcaagggtc 

gtagatgtta 

gtacaatatt 

ccatcttgag 



ccagatctgg 
tcacgccctt 
ccaatatatc 
atgagcggag 
tttacgtttg 
aattcccgat 
tatattttgt 
catctcataa 
agaaattata 
tgccaaatgt 
actagtggat 
tcgacctcga 
gaaattcaga 
tatttatcca 
cttgctgtaa 
ctattcaaaa 
cgcgaaattc 
aaaaataaaa 
cctcgatcga 
gcggatccta 
ccccatcgta 
ctaccagttt 
aaaggcaatt 
cgctcttagc 
aaattaatga 
caccaaaaac 
acatccaacg 
actcaccggt 
accacaggcc 



ggaacctgtg 
ttaaatatcc 
ctgtcaaaca 
aattaaggga 
gaactgacag 
ctagtaacat 
tttctatcgc 
ataacgtcat 
tgataatcat 
ttgaacgatc 
cccccgggct 
gggggggccc 
tgctagtgta 
tttctgaaaa 
agcgttgttt 
gtcgttaatg 
aggcccggtt 
actgactcgg 
gaaagagatc 
caggccaaat 
ggtgaaggtg 
cctcaagggt 
tgtagatgtt 
cgtacaatat 
tccatcttga 
gtaagcgctt 
tcgctttcag 
gcgatgcccc 
cacaacagct 



gttggcatgc 
gattattcta 
ctgatagttt 
gtcacgttat 
aaccgcaacg 
agatgacacc 
gtattaaatg 
gcattacatg 
cgcaagaccg 
ggggaaattc 
gcaggaattc 
ggtaccccta 
atgtattggt 
ggcgaaatgg 
ggtacacttt 
gctgcggatc 
gccatgtcct 
atacttacgt 
aatgttgagc 
tcgctcttag 
gaaattaatg 
ccaccaaaaa 
aacatccaac 
tactcaccgg 
gaccacaggc 
acgtacatgg 
ggatcctaca 
ccatcgtagg 
accagtttcc 



acatacaaat 
ataaacgctc 
aaactgaagg 
gacccccgcc 
attgaaggag 
gcgcgcgata 
tataattgcg 
ttaattatta 
gcaacaggat 
gagctccacc 
gatcagatct 
gagtcgattt 
aatttgggaa 
cgtcaccgcg 
tgactagcga 
aagaaaaagt 
acacgccgaa 
cacgtcttgc 
tgcttcaaaa 
ccgtacaata 
atccatcttg 
cgtaagcgct 
gtcgctttca 
tgcgatgccc 
ccacaacagc 
tcgataagaa 
ggccaaattc 
tgaaggtgga 
tcaagggtcc 



ggacgaacgg 
ttttctctta 
cgggaaacga 
gatgacgcgg 
ccactcagcc 
atttatccta 
ggactctaat 
catgcttaac 
tcaatcttaa 
gcggtggcgg 
gatcaagctt 
ggtgtatcga 
gatataatag 
agcgtcacgc 
ggcttggcgt 
tggaatagaa 
ataaacgacc 
gcactgattt 
gcaatgggat 
ttactcaccg 
agaccacagg 
tacgtacatg 
gggatcctac 
cccatcgtag 
taccagtttc 
aaggcaattt 
gctcttagcc 
aattaatgat 
accaaaaacg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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taagcgctta 
cgctttcagg 
gaatattatt 
aattattaat 
catcaataga 
atagttgttt 
gcaagggctg 
cccggatgaa 
aagcaggtag 
gcaagcgaac 
gtaaactgga 
gcggagaatt 
cgtttggaac 
ttaatgagct 
cactatcagc 
ctcggtatcc 
attgaacaag 
tatgactggg 
caggggcgcc 
gacgaggcag 
gacgttgtca 
ctcctgtcat 
cggctgcata 
gagcgagcac 
catcaggggc 
gatgatctcg 
cgcttttctg 
gcgttggcta 
gtgctttacg 
gagttcttct 
tttatgtatt 
ataatcagtt 
ttggactata 
agatgggaat 
ttacccaact 
aggcccgcac 
gctttcttcc 
gggctccctt 
ttgggtgatg 
ttggagtcca 
atctcgggct 
aggattttcg 
aggcggtgaa 
tacattaaaa 
caatatatcc 
ccactcgata 



cgtacatggt 
gatccgcgag 
ataataatat 
aatatatata 
agactacgta 
catatccaac 
ctaaaggaag 
tgtcagctac 
cttgcagtgg 
cggaattgcc 
tggctttctt 
aagggagtca 
tgacagaacc 
aagcacatac 
tagcaaatat 
aattagagtc 
atggattgca 
cacaacagac 
cggttctttt 
cgcggctatc 
ctgaagcggg 
ctcaccttgc 
cgcttgatcc 
gtactcggat 
tcgcgccagc 
tcgtgaccca 
gattcatcga 
cccgtgatat 
gtatcgccgc 
gagcgggact 
acacataata 
attgaaatat 
atacctgact 
taattcactg 
taatcgcctt 
cgatcgccct 
cttcctttct 
tagggttccg 
gttcacgtag 
cgttctttaa 
attcttttga 
cctgctgggg 
gggcaatcag 
acgtccgcaa 
tgccaccagc 
caggcagccc 



cgataagaaa 
cttatcgcga 
aaaatatata 
ttaatcattt 
taaaaataat 
atgatgtcca 
cggaacacgt 
tgggctatct 
gcttacatgg 
agctggggcg 
gccgccaagg 
cgttatgacc 
gcaacgttga 
gtcagaaacc 
ttcttgtcaa 
tcatattcac 
cgcaggttct 
aatcggctgc 
tgtcaagacc 
gtggctggcc 

aa 999 act 99 
tcctgccgag 
ggctacctgc 
ggaagccggt 
cgaactgttc 
tggcgatgcc 
ctgtggccgg 
tgctgaagag 
tcccgattcg 
ctgaggatcc 
tcgcactcag 
ttctgaattt 
tgttatttta 
gccgtcgttt 
gcagcacatc 
tcccaacagt 
cgccacgttc 
atttagtgct 
tgggccatcg 
tagtggactc 
tttataaggg 
caaaccagcg 
ctgttgcccg 
tgtgttatta 
cagccaacag 
atcag 
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aggcaatttg 
taccgtcgaa 
tatataaatt 
agatatataa 
cagataacat 
gagcttcacg 
agaaagccag 
ggacaaggga 
cgatagctag 
ccctctggta 
atctgatggc 
cccgccgatg 
aggagccact 
attattgcgc 
aaatgctcca 
tctcaatcca 
ccggccgctt 
tctgatgccg 
gacctgtccg 
acgacgggcg 
ctgctattgg 
aaagtatcca 
ccattcgacc 
cttgtcgatc 
gccaggctca 
tgcttgccga 
ctgggtgtgg 
cttggcggcg 
cagcgcatcg 
cccgatgagc 
tctttcatct 
aaacttgcat 
tcaataaata 
tacaacgtcg 
cccctttcgc 
tgcgcagcct 
gccggctttc 
ttacggcacc 
ccctgataga 
ttgttccaaa 
attttgccga 
tggaccgctt 
tctcactggt 
agttgtctaa 
ctccccgacc 



tagatgttaa 
tataataatt 
ataatatatt 
ttctatagcc 
ctaaaacatg 
ctgccgcaag 
tccgcagaan 
aaacgcaagc 
actgggcggt 
aggttgggaa 
gcaggggatc 
acgcgggaca 
cagccgcggg 
gttcaaaagt 
ctgacgttcc 
gatctggatc 
gggtggagag 
ccgtgttccg 
gtgccctgaa 
ttccttgcgc 
gcgaagtgcc 
tcatggctga 
accaagcgaa 
aggatgatct 
aggcgcgcat 
atatcatggt 
cggaccgcta 
aatgggctga 
ccttctatcg 
taagctagct 
acggcaatgt 
caataaattt 
tttaaactat 
tgactgggaa 
cagctggcgt 
gaatggcgcc 
cccgtcaagc 
tcgaccccaa 
cggtttttcg 
ctggaacaac 
tttcggaacc 
gctgcaactc 
gaaaagaaaa 
gcgtcaattt 
ggcagctcgg 



catccaacgt 
atatttgtaa 
aattattgtt 
ttagactcct 
tagataaata 
cactcagggc 
cggtgctgac 
gcannagaga 
tttatggaca 
gccctgcaaa 
aagatcatga 
agccgtttta 
tttctggagt 
cgcctaaggt 
ataaattccc 
gtttcgcatg 
gctattcggc 
gctgtcagcg 
tgaactgcag 
agctgtgctc 
ggggcaggat 
tgcaatgcgg 
acatcgcatc 
ggacgaagag 
gcccgacggc 
ggaaaatggc 
tcaggacata 
ccgcttcctc 
ccttcttgac 
atatcatcaa 
accagctgat 
atgtttttgc 
atttctttca 
aaccctggcg 
aatagcgaag 
cgctcctttc 
tctaaatcgg 
aaaacttgat 
ccctttgacg 
actcaaccct 
accatcaaac 
tctcagggcc 
accaccccag 
gtttacacca 
cacaaaatca 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4465 



<210> 14 
<211> 8074 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Expression-type plasmid pBMSP- IspJSC 

<220> 

<221> modif ied_base 

<222> (2315) 

<223> a, c, t or g 
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<400> 14 

ctgatgggct 

ggctggctgg 

acacattgcg 

aacagctgat 

gtttgcccca 

ataaatcaaa 

cactattaaa 

gcccactacg 

taaatcggaa 

tggcgagaaa 

tgggaagggc 

gctgcaaggc 

acggccagtg 

taacaagtca 

cagaaatatt 

tgcgatatta 

ctcagagtcc 

cgggagcggc 

cagcaatatc 

cacagtcgat 

cgccatgggt 

gttcggctgg 

cttccatccg 

tagccggatc 

caggagcaag 

cccttcccgc 

gccacgatag 

tgacaaaaag 

cgattgtctg 

ctgcgtgcaa 

tatgagactc 

aagaaatatt 

ctgacgtatg 

gttgcggttc 

taacgtgact 

gcggcaagaa 

cagctggcaa 

taagcccact 

cccagtagct 

tccgcttcct 

tcatgttgga 

tttatacgta 

ttaatatata 

ttttatatta 

taagctcgcg 

atcgaccatg 

tgtgggcctg 

tcgcaccggt 

gcgacgttgg 

ttacgttttt 

tggatcatta 

tacggctaag 

tacaaattgc 

gaggaaactg 

cacctacgat 

cctgtaggat 

tttctcgatc 

cagtttttat 

gcctgaattt 



gcctgtatcg 
tggcaggata 
gacgttttta 
tgcccttcac 
gcaggcgaaa 
agaatagccc 
gaacgtggac 
tgaaccatca 
ccctaaaggg 
ggaagggaag 
gatcggtgcg 
gattaagttg 
aattaattcc 
ggtattatag 
tcaataactg 
tgtgtaatac 
cgctcagaag 
gataccgtaa 
acgggtagcc 
gaatccagaa 
cacgacgaga 
cgcgagcccc 
agtacgtgct 
aagcgtatgc 
gtgagatgac 
ttcagtgaca 
ccgcgctgcc 
aaccgggcgc 
ttgtgcccag 
tccatcttgt 
taattggata 
tgctagctga 
tgcttagctc 
tgtcagttcc 
cccttaattc 
agccatccag 
ttccggttcg 
gcaagctacc 
gacattcatc 
ttagcagccc 
tatgaaacaa 
gtcttctatt 
tattattaat 
ttataataat 
gatccctgaa 
tacgtaagcg 
tggtctcaag 
gagtaatatt 
atgttaacat 

ggtggaccct 

atttccacct 
agcgaatttg 
cttttcttat 
gtagctgttg 

ggggggcatc 

ccgcgagctg 
gagggagatt 
ttttctacta 
cgcgggtatt 



agtggtgatt 
tattgtggtg 
atgtactggg 
cgcctggccc 
atcctgtttg 
gagatagggt 
tccaacgtca 
cccaaatcaa 
agcccccgat 
aaagcgaaag 
ggcctcttcg 
ggtaacgcca 
catcttgaaa 
tccaagcaaa 
attatatcag 
ataaattgat 
aactcgtcaa 
agcacgagga 
aacgctatgt 
aagcggccat 
tcatcgccgt 
tgatgctctt 
cgctcgatgc 
agccgccgca 
aggagatcct 
acgtcgagca 
tcgtcctgca 
ccctgcgctg 
tcatagccga 
tcaatcatgc 
ccgaggggaa 
tagtgacctt 
attaaactcc 
aaacgtaaaa 
tccgctcatg 
tttactttgc 
cttgctgtcc 
tgctttctct 
cggggtcagc 
ttgcgccctg 
ctattattta 
gatgaggagt 
aattaacaat 
attcttacaa 
agcgacgttg 
cttacgtttt 
atggatcatt 
gtacggctaa 
ctacaaattg 
tgaggaaact 
tcacctacga 
gcctgtagga 
cgaccatgta 
tgggcctgtg 
gcaccggtga 
gtcaatccca 
tttcaaatca 
atttggtcgt 
ctgtttctat 



ttgtgccgag 
taaacaaatt 
gtggtttttc 
tgagagagtt 
atggtggttc 
tgagtgttgt 
aagggcgaaa 
gttttttggg 
ttagagcttg 
gagcgggcgc 
ctattacgcc 
gggttttccc 
gaaatatagt 
aacataaatt 
ctggtacatt 
gatatagcta 
gaaggcgata 
agcggtcagc 
cctgatagcg 
tttccaccat 
cgggcatgcg 
cgtccagatc 
gatgtttcgc 
ttgcatcagc 
gccccggcac 
cagctgcgca 
gttcattcag 
acagccggaa 
atagcctctc 
gaaacgatcc 
tttatggaac 
aggcgacttt 
agaaacccgc 
cggcttgtcc 
atcttgatcc 
agggcttccc 
ataaaaccgc 
ttgcgcttgc 
accgnttctg 
agtgcttgcg 
tctacatgtt 
ctaaggctat 
aattaatata 
atataattat 
gatgttaaca 
tggtggaccc 
aatttccacc 
gagcgaattt 
ccttttctta 
ggtagctgtt 
tggggggcat 
tccctgaaag 
cgtaagcgct 
gtctcaagat 
gtaatattgt 
ttgcttttga 
gtgcgcaaga 
ttatttcggc 
tccaactttt 



ctgccggtcg 
gacgcttaga 
ttttcaccag 
gcagcaagcg 
cgaaatcggc 
tccagtttgg 
aaccgtctat 
gtcgaggtgc 
acggggaaag 
cattcaggct 
agctggcgaa 
agtcacgacg 
ttaaatattt 
tattgatgca 
gccgtagatg 
gcttagctca 
gaaggcgatg 
ccattcgccg 
gtccgccaca 
gatattcggc 
cgccttgagc 
atcctgatcg 
ttggtggtcg 
catgatggat 
ttcgcccaat 
aggaacgccc 
ggcaccggac 
cacggcggca 
cacccaagcg 
agatctggat 
gtcagtggag 
tgaacgcgca 
ggctgagtgg 
cgcgtcatcg 
cctgcgccat 
aaccttacca 
ccagtctagc 
gttttccctt 
cggactggct 
gcagcgtgaa 
ttagatgtta 
agaattatat 
ttataattta 
tatattcgac 
tctacaaatt 
ttgaggaaac 
ttcacctacg 
ggcctgtagg 
tcgaccatgt 
gtgggcctgt 
cgcaccggtg 
cgacgttgga 
tacgtttttg 
ggatcattaa 
acggctaaga 
agcagctcaa 
cgtgacgtaa 
gtgtaggaca 
tcttgatccg 



gggagctgtt 
caacttaata 
tgagacgggc 
gtccacgctg 
aaaatccctt 
aacaagagtc 

cagggcgatg 

cgtaaagcac 
ccggcgaacg 
gcgcaactgt 
agggggatgt 
ttgtaaaacg 
attgataaaa 
agtttaaatt 
aaagactgag 
tcgggggatc 
cgctgcgaat 
ccaagctctt 
cccagccggc 
aagcaggcat 
ctggcgaaca 
acaagaccgg 
aatgggcagg 
actttctcgg 
agcagccagt 
gtcgtggcca 
aggtcggtct 
tcagagcagc 
gccggagaac 
tgagagtgaa 
catttttgac 
ataatggttt 
ctccttcaac 
gcgggggtca 
cagatccttg 
gagggcgccc 
tatcgccatg 
gtccagatag 
ttctacgtgt 
gctctggaca 
tctgattatt 
atctaaatga 
tatatatata 
ggtatcgcga 
gccttttctt 
tggtagctgt 
atggggggca 
atccctgaaa 
acgtaagcgc 
ggtctcaaga 
agtaatattg 
tgttaacatc 
gtggaccctt 
tttccacctt 
gcgaatttgg 
cattgatctc 
gtatccgagt 
tggcaaccgg 
cagccattaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 
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cgacttttga 
cgctttacag 
gaaatggata 
agcatctgaa 
ctcttcgtgc 
tttggtcccg 
ccacccacct 
ggctgcataa 
aacctcacca 
gatgactccg 
gtcagcctgg 
gacctgggca 
aagtccttgt 
gtgaatccca 
ttcagcgttg 
ggggatgatt 
gagctggttt 
gtggcaaacg 
gtcaacaccc 
gacaaggatg 
tacctgtgtg 
caactcttcg 
gcaggaagct 
tactggtgtc 
gggtgggtta 
accttcactg 
accaacggcg 
aacctcaagg 
cactttccat 
tgccaggccc 
aacagccggc 
tggtgtggag 
gaagagagga 
gatgagaagg 
aggctttttg 
ttgaatcctg 
catgtaataa 
gtcccgcaat 
aaattatcgc 
aaggatccct 
atgtacgtaa 
ctgtggtctc 
ggtgagtaat 
tggatgttaa 
tttggtggac 
ttaatttcca 
aagagcgaat 
tgccttttct 
ctggtagctg 
gatggggggc 
gatccgcgag 
agggagattt 
tttctactaa 
gcgggtattc 
tagatacgct 
aagaacggaa 
atagccttgc 
ttcataacca 
cttttttgtc 
aagatgaaag 
tcatccgttc 



atagatacgc 
caagaacgga 
aatagccttg 
tttcataacc 
tcacctgcct 
aggaggtgaa 
ctgtcaaccg 
ccctcatctc 
acttcccgga 
ggcgctacaa 
aggtcagcca 
gaacggtgac 
acaagcagat 
actatacagg 
tcatcaacca 
ccaatagtaa 
atgaagacct 
tggccaaatt 
tggggaagag 
gctcattcag 
gagcccattc 
tcaatgagga 
ctgtggccgt 
tctgggaagg 
aggcccagta 
tcatcctcaa 
atactctctg 
ttcccgggaa 
gcaaattctc 
tgcccagcca 
ttgtctccct 
tgaagcaggg 
aggcagcggg 
tgctagactc 
cagagtgaat 
ttgccggtct 
ttaacatgta 
tatacattta 
gcgcggtgtc 
gaaagcgacg 
gcgcttacgt 
aagatggatc 
attgtacggc 
catctacaaa 
ccttgaggaa 
ccttcaccta 
ttggcctgta 
tatcgaccat 
ttgtgggcct 
atcgcaccgg 
ctggtcaatc 
ttcaaatcag 
tttggtcgtt 
tgtttctatt 
gacacgccaa 
tgcgcgtgac 
ttcctattat 
atctcgatac 
tcttctttca 
gattgttctt 
ttccgaagat 



tgacacgcca 
atgcgcgtga 
cttcctatta 
aatctcgata 
gctggcggtc 
tagtgtggaa 
gcacacccgg 
ctcggagggc 
gaacggcaca 
gtgtggcctg 
gggtcctggg 
catcaactgc 
aggcctgtac 
aagaatacgc 
actcaggctc 
taagaagaat 
gaggggctca 
tctgtgccga 
ggccccagcc 
tgtggtgatc 
ggatggtcag 
gtccacgatt 
gctctgcccc 
ggcccagaat 
cgagggccgc 
ccagctcacc 
gaggaccacc 
tgtcacggct 
ctcgtacgag 
agacgaaggc 
gaccctgaac 
ccacttctat 
gtcccgcgat 
tggttttcgg 
tcccgatcgt 
tgcgatgatt 
atgcatgacg 
atacgcgata 
atctatgtta 
ttggatgtta 
ttttggtgga 
attaatttcc 
taagagcgaa 
ttgccttttc 
actggtagct 
cgatgggggg 
ggatccctga 
gtacgtaagc 
gtggtctcaa 
tgagtaatat 
ccattgcttt 
tgcgcaagac 
tatttcggcg 
ccaacttttt 
gcctcgctag 
gctcgcggtg 
atcttcccaa 
accaaatcga 
ttgtcattgc 
gttgacaaca 
cctaatgagg 
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agcctcgcta 
cgctcgcggt 
tatcttccca 
caccaaatcg 
ttcccagcca 
ggtaactcag 
aagtactggt 
tacgtctcca 
tttgtggtga 
ggcatcaata 
ctcctaaatg 
cctttcaaga 
cctgtgctgg 
cttgatattc 
agcgatgctg 
gctgacctcc 
gtgaccttcc 
cagagcagtg 
tttgagggca 
acaggcctga 
ctgcaggaag 
ccccgcagcc 
tacaaccgta 
ggccgctgcc 
ctctccctgc 
agccgggacg 
gtggagatca 
gtgctgggag 
aaatactggt 
cccagcaagg 
ctggtgacca 
ggagagactg 
gtcagcctag 
gagattgaga 
tcaaacattt 
atcatataat 
ttatttatga 
gaaaacaaaa 
ctagatcggg 
acatctacaa 
cccttgagga 
accttcacct 
tttggcctgt 
ttatcgacca 
gttgtgggcc 
catcgcaccg 
aagcgacgtt 
gcttacgttt 
gatggatcat 
tgtacggcta 
tgaagcagct 
gtgacgtaag 
tgtaggacat 
cttgatccgc 
tcaaaagtgt 
acgccatttc 
attaccaata 
ctctagagga 
ttttgtttac 
aatgtaagtg 
acattgtgga 



gtcaaaagtg 
gacgccattt 
aattaccaat 
actctagggg 
tctccacgaa 
tgtccatcac 
gccggcaggg 
gcaaatatgc 
acattgccca 
gccgaggcct 
acactaaagt 
ctgagaatgc 
tcatcgactc 
agggtactgg 
ggcagtatct 
aagtgctaaa 
actgtgccct 
gggaaaactg 
ggatcctgct 
ggaaggagga 
gctcgcctat 
ccactgtggt 
aggaaagcaa 
ccctgctggt 
tggaggagcc 
ccggcttcta 
agattatcga 
agactctcaa 
gcaagtggaa 
ccttcgtgaa 
gggctgatga 
cagccgtcta 
cgaaggcaga 
acaaagccat 
ggcaataaag 
ttctgttgaa 
gatgggtttt 
tatagcgcgc 
gatccgtcga 
attgcctttt 
aactggtagc 
acgatggggg 
aggatccctg 
tgtacgtaag 
tgtggtctca 
gtgagtaata 
ggatgttaac 
ttggtggacc 
taatttccac 
agagcgaatt 
caacattgat 
tatccgagtc 
ggcaaccggg 
agccattaac 
accaaacaac 
gccttttcag 
cattacacta 
tctaaccatg 
atctactagt 
tgcccggatt 
gagaaacatc 



taccaaacaa 
cgccttttca 
acattacact 
taccatggtg 
gagtcccata 
gtgctactac 
agctagaggt 
aggcagggct 
gctgagccag 
gtcctttgat 
ctacacagtg 
tcaaaagagg 
cagtggttat 
ccagttactg 
ctgccaggct 
gcccgagccc 
gggccctgag 
tgacgtggtc 
caacccccag 
tgcagggcgc 
ccaggcctgg 
gaagggggtg 
aagcatcaag 
ggacagcgag 
aggcaacggc 
ctggtgtctg 
aggagaacca 
ggtcccctgt 
taacacgggc 
ctgtgacgag 
gggctggtac 
tgtggcagtt 
cgctgctcct 
tcaggatccc 
tttcttaaga 
ttacgttaag 
tatgattaga 
aaactaggat 
cggtatcgat 
cttatcgacc 
tgttgtgggc 
gcatcgcacc 
aaagcgacgt 
cgcttacgtt 
agatggatca 
ttgtacggct 
atctacaaat 
cttgaggaaa 
cttcacctac 
tggcctgtag 
ctctttctcg 
agtttttatt 
cctgaatttc 
gacttttgaa 
gctttacagc 
aaatggataa 
gcatctgaat 
ggatctaaac 
ttggcacaag 
acttccagga 
cgaattattg 



3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
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ttcctctgaa 
tgtaccattt 
agatagttac 
acacttatga 
ccaaaatggt 
ccccgatcgt 
tgcgatgatt 
atgcatgacg 
atacgcgata 
atctatgtta 
ggttctgtca 
tgactccctt 
tcagtgtttg 
aatcggatat 
caaccacagg 



caacagggag 
gtctgacctc 
tgctacccag 
cagaaacaag 
ggaaacagcc 
tcaaacattt 
atcatataat 
ttatttatga 
gaaaacaaaa 
ctagatcggg 
gttccaaacg 
aattctccgc 
acaggatata 
ttaaaagggc 
ttccccagat 



aatatctctg 
tgtaaaaaat 
agcaatatct 
tgctacacag 
ttaaccccag 
ggcaataaag 
ttctgttgaa 
gatgggtttt 
tatagcgcgc 
aattaattca 
taaaacggct 
tcatgatcag 
ttggcgggta 
gtgaaaaggt 
ctggcgccgg 
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atcccacctc 
gtgatcctac 
gtgatgaaga 
ctgtggtccc 
atgcctgcta 
tttcttaaga 
ttacgttaag 
tatgattaga 
aaactaggat 
gatcggctga 
tgtcccgcgt 
attgtcgttt 
aacctaagag 
ttatccgttc 
ccag 



accattgaga 
agaagtggag 
cagtgctaca 
actcgtatat 
tcctgactga 
ttgaatcctg 
catgtaataa 
gtcccgcaat 
aaattatcgc 
gtggctcctt 
catcggcggg 
cccgccttca 
aaaagagcgt 
gtccatttgt 



accagatttg 
ctggataatc 
gagacctgct 
ggtggtgaga 
gctcgaattt 
ttgccggtct 
ttaacatgta 
tatacattta 
gcgcggtgtc 
caatcgttgc 
ggtcataacg 
gtttaaacta 
ttattagaat 
atgtgcatgc 



7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8074 



<210> 15 

<211> 1062 

<212> DNA 

<213> Homo sapiens 



<400> 15 

gcatccccga 

aacgtggtca 

tggagcgaaa 

ggggacctgt 

aagtccgtga 

tgcccagttc 

tcatgctgcc 

tcagaagcga 

acctggacgc 

ggctgctaca 

accttcactt 

aaatccggaa 

gccctgaacg 

ctggttcgct 

tcccggcagg 

gcagccgagg 

ccgctggcct 

gtgtctgttg 



ccagccccaa 
tcgcctgcct 
gcggacaggg 
acaccacgag 
catgccacgt 
cctcaactcc 
acccccgact 
acctcacgtg 
cctcaagtgg 
gcgtgtccag 
gcactgctgc 
acacattccg 
agctggtgac 
ggctgcaggg 
agcccagcca 
actggaagaa 
tcacacagaa 
tcatggcgga 



ggtcttcccg 
ggtccagggc 
cgtgaccgcc 
cagccagctg 
gaagcactac 
acctacccca 
gtcactgcac 
cacactgacc 
gaagagcgct 
tgtcctgccg 
ctaccccgag 
gcccgaggtc 
gctgacgtgc 
gtcacaggag 
gggcaccacc 
gggggacacc 
gaccatcgac 
ggtggacggc 



ctgagcctct 
ttcttccccc 
agaaacttcc 
accctgccgg 
acgaatccca 
tctccctcaa 
cgaccggccc 
ggcctgagag 
gttcaaggac 
ggctgtgccg 
tccaagaccc 
cacctgctgc 
ctggcacgcg 
ctgccccgcg 
accttcgctg 
ttctcctgca 
cgcttggcgg 
acctgctact 



gcagcaccca 
aggagccact 
cacccagcca 
ccacacagtg 
gccaggatgt 
ctccacctac 
tcgaggacct 
atgcctcagg 
cacctgagcg 
agccatggaa 
cgctaaccgc 
cgccgccgtc 
gcttcagccc 
agaagtacct 
tgaccagcat 
tggtgggcca 
gtaaacccac 
ga 



gccagatggg 
cagtgtgacc 
ggatgcctcc 
cctagccggc 
gactgtgccc 
cccatctccc 
gctcttaggt 
tgtcaccttc 
tgacctctgt 
ccatgggaag 
caccctctca 
ggaggagctg 
caaggacgtg 
gacttgggca 
actgcgcgtg 
cgaggccctg 
ccatgtcaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1062 



<210> 16 

<211> 353 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Ala Ser Pro Thr Ser Pro Lys Val Phe Pro Leu Ser Leu Cys Ser Thr 
1 5 10 15 

Gin Pro Asp Gly Asn Val Val lie Ala Cys Leu Val Gin Gly Phe Phe 
20 25 30 

Pro Gin Glu Pro Leu Ser Val Thr Trp Ser Glu Ser Gly Gin Gly Val 
35 40 45 



Thr Ala Arg Asn Phe Pro Pro Ser Gin Asp Ala Ser Gly Asp Leu Tyr 
50 55 60 
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Thr Thr Ser Ser Gin Leu Thr Leu Pro Ala Thr Gin Cys Leu Ala Gly 
65 70 75 80 

Lys Ser Val Thr Cys His Val Lys His Tyr Thr Asn Pro Ser Gin Asp 
85 90 95 

Val Thr Val Pro Cys Pro Val Pro Ser Thr Pro Pro Thr Pro Ser Pro 
100 105 110 

Ser Thr Pro Pro Thr Pro Ser Pro Ser Cys Cys His Pro Arg Leu Ser 
115 120 125 

Leu His Arg Pro Ala Leu Glu Asp Leu Leu Leu Gly Ser Glu Ala Asn 
130 135 140 

Leu Thr Cys Thr Leu Thr Gly Leu Arg Asp Ala Ser Gly Val Thr Phe 
145 150 155 160 

Thr Trp Thr Pro Ser Ser Gly Lys Ser Ala Val Gin Gly Pro Pro Glu 
165 170 175 

Arg Asp Leu Cys Gly Cys Tyr Ser Val Ser Ser Val Leu Pro Gly Cys 
180 185 190 

Ala Glu Pro Trp Asn His Gly Lys Thr Phe Thr Cys Thr Ala Ala Tyr 
195 200 205 

Pro Glu Ser Lys Thr Pro Leu Thr Ala Thr Leu Ser Lys Ser Gly Asn 
210 215 220 

Thr Phe Arg Pro Glu Val His Leu Leu Pro Pro Pro Ser Glu Glu Leu 
225 230 235 240 

Ala Leu Asn Glu Leu Val Thr Leu Thr Cys Leu Ala Arg Gly Phe Ser 
245 250 255 

Pro Lys Asp Val Leu Val Arg Trp Leu Gin Gly Ser Gin Glu Leu Pro 
260 265 270 

Arg Glu Lys Tyr Leu Thr Trp Ala Ser Arg Gin Glu Pro Ser Gin Gly 
275 280 285 

Thr Thr Thr Phe Ala Val Thr Ser lie Leu Arg Val Ala Ala Glu Asp 
290 295 300 

Trp Lys Lys Gly Asp Thr Phe Ser Cys Met Val Gly His Glu Ala Leu 
305 310 315 320 

Pro Leu Ala Phe Thr Gin Lys Thr lie Asp Arg Leu Ala Gly Lys Pro 
325 330 335 

Thr His Val Asn Val Ser Val Val Met Ala Glu Val Asp Gly Thr Cys 
340 345 350 

Tyr 

<210> 17 

<211> 1023 

<212> DNA 

<213> Homo sapiens 
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<400> 17 

gcatccccga 

aacgtggtcg 

tggagcgaaa 

ggggacctgt 

aagtccgtga 

tgcccagttc 

ctcgaggacc 

gatgcctctg 

ccacctgagc 

cagccatgga 

ccactaaccg 

ccgccgccgt 

ggcttcagcc 

gagaagtacc 

gtgaccagca 

atggtgggcc 

ggtaaaccca 

tga 



ccagccccaa 
tcgcatgcct 
gcggacagaa 
acaccacgag 
catgccacgt 
ccccacctcc 
tgctcttagg 
gtgccacctt 
gtgacctctg 
accatgggga 
ccaacatcac 
cggaggagct 
ccaaggatgt 
tgacttgggc 
tactgcgcgt 
acgaggccct 
cccatgtcaa 



ggtcttcccg 
ggtccagggc 
cgtgaccgcc 
cagccagctg 
gaagcactac 
cccatgctgc 
ttcagaagcg 
cacctggacg 
tggctgctac 
gaccttcacc 
aaaatccgga 
ggccctgaac 
gctggttcgc 
atcccggcag 
ggcagccgag 
gccgctggcc 
tgtgtctgtt 



ctgagcctcg 
ttcttccccc 
agaaacttcc 
accctgccgg 
acgaatccca 
cacccccgac 
aacctcacgt 
ccctcaagtg 
agcgtgtcca 
tgcactgctg 
aacacattcc 
gagctggtga 
tggctgcagg 
gagcccagcc 
gactggaaga 
ttcacacaga 
gtcatggcgg 



acagcacccc 
aggagccact 
cacctagcca 
ccacacagtg 
gccaggatgt 
tgtcgctgca 
gcacactgac 
ggaagagcgc 
gtgtcctgcc 
cccaccccga 
ggcccgaggt 
cgctgacgtg 
ggtcacagga 
agggcaccac 
agggggacac 
agaccatcga 
aggtggacgg 



ccaagatggg 
cagtgtgacc 
ggatgcctcc 
cccagacggc 
gactgtgccc 
ccgaccggcc 
cggcctgaga 
tgttcaagga 
tggctgtgcc 
gttgaagacc 
ccacctgctg 
cctggcacgt 
gctgccccgc 
caccttcgct 
cttctcctgc 
ccgcttggcg 
cacctgctac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1023 



<210> 18 
<211> 340 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Ala Ser Pro Thr Ser Pro Lys Val Phe Pro Leu Ser Leu Asp Ser Thr 
15 10 15 

Pro Gin Asp Gly Asn Val Val Val Ala Cys Leu Val Gin Gly Phe Phe 
20 25 30 

Pro Gin Glu Pro Leu Ser Val Thr Trp Ser Glu Ser Gly Gin Asn Val 
35 40 45 

Thr Ala Arg Asn Phe Pro Pro Ser Gin Asp Ala Ser Gly Asp Leu Tyr 
50 55 60 

Thr Thr Ser Ser Gin Leu Thr Leu Pro Ala Thr Gin Cys Pro Asp Gly 
65 70 75 80 

Lys Ser Val Thr Cys His Val Lys His Tyr Thr Asn Pro Ser Gin Asp 
85 90 95 

Val Thr Val Pro Cys Pro Val Pro Pro Pro Pro Pro Cys Cys His Pro 
100 105 110 

Arg Leu Ser Leu His Arg Pro Ala Leu Glu Asp Leu Leu Leu Gly Ser 
115 120 125 

Glu Ala Asn Leu Thr Cys Thr Leu Thr Gly Leu Arg Asp Ala Ser Gly 
13 0 13 5 14 0 

Ala Thr Phe Thr Trp Thr Pro Ser Ser Gly Lys Ser Ala Val Gin Gly 
145 150 155 160 



Pro Pro Glu Arg Asp Leu Cys Gly Cys Tyr Ser Val Ser Ser Val Leu 
165 170 175 



WO 03/064992 PCT/US02/34197 

18/119 

Pro Gly Cys Ala Gin Pro Trp Asn His Gly Glu Thr Phe Thr Cys Thr 
180 185 190 

Ala Ala His Pro Glu Leu Lys Thr Pro Leu Thr Ala Asn lie Thr Lys 
195 200 205 

Ser Gly Asn Thr Phe Arg Pro Glu Val His Leu Leu Pro Pro Pro Ser 
210 215 220 

Glu Glu Leu Ala Leu Asn Glu Leu Val Thr Leu Thr Cys Leu Ala Arg 
225 230 235 240 

Gly Phe Ser Pro Lys Asp Val Leu Val Arg Trp Leu Gin Gly Ser Gin 
245 250 255 

Glu Leu Pro Arg Glu Lys Tyr Leu Thr Trp Ala Ser Arg Gin Glu Pro 
260 265 270 

Ser Gin Gly Thr Thr Thr Phe Ala Val Thr Ser lie Leu Arg Val Ala 
275 280 285 

Ala Glu Asp Trp Lys Lys Gly Asp Thr Phe Ser Cys Met Val Gly His 
290 295 300 

Glu Ala Leu Pro Leu Ala Phe Thr Gin Lys Thr lie Asp Arg Leu Ala 
305 310 315 320 

Gly Lys Pro Thr His Val Asn Val Ser Val Val Met Ala Glu Val Asp 
325 330 335 

Gly Thr Cys Tyr 
340 



<210> 19 
<211> 993 
<212> DNA 

<213> Homo sapiens 
<400> 19 

gcctccacca agggcccatc ggtcttcccc ctggcaccct cctccaagag cacctctggg 60 
ggcacagcgg ccctgggctg cctggtcaag gactacttcc ccgaaccggt gacggtgtcg 12 0 
tggaactcag gcgccctgac cagcggcgtg cacaccttcc cggctgtcct acagtcctca 180 
ggactctact ccctcagcag cgtggtgacc gtgccctcca gcagcttggg cacccagacc 240 
tacatctgca acgtgaatca caagcccagc aacaccaagg tggacaagaa agttgagccc 3 00 
aaatcttgtg acaaaactca cacatgccca ccgtgcccag cacctgaact cctgggggga 360 
ccgtcagtct tcctcttccc cccaaaaccc aaggacaccc tcatgatctc ccggacccct 420 
gaggtcacat gcgtggtggt ggacgtgagc cacgaagacc ctgaggtcaa gttcaactgg 480 
tacgtggacg gcgtggaggt gcataatgcc aagacaaagc cgcgggagga gcagtacaac 54 0 
agcacgtacc gggtggtcag cgtcctcacc gtcctgcacc aggactggct gaatggcaag 600 
gagtacaagt gcaaggtctc caacaaagcc ctcccagccc ccatcgagaa aaccatctcc 660 
aaagccaaag ggcagccccg agaaccacag gtgtacaccc tgcccccatc ccgggatgag 720 
ctgaccaaga accaggtcag cctgacctgc ctggtcaaag gcttctatcc cagcgacatc 780 
gccgtggagt gggagagcaa tgggcagccg gagaacaact acaagaccac gcctcccgtg 840 
ctggactccg acggctcctt cttcctctac agcaagctca ccgtggacaa gagcaggtgg 900 
cagcagggga acgtcttctc atgctccgtg atgcatgagg ctctgcacaa ccactacacg 960 
cagaagagcc tctccctgtc tccgggtaaa tga 993 



<210> 20 
<211> 330 
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<212> PRT 

<213> Homo sapiens 

<400> 20 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys 
15 10 15 

Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
20 25 30 

Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
35 40 45 

Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 
50 55 60 

Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
65 70 75 80 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys 
85 90 95 

Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys 
100 105 110 

Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro 
115 120 125 

Lys Pro Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys 
130 135 140 

Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp 
145 150 155 160 

Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu 
165 170 175 

Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu 
180 185 190 

His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn 
195 200 205 

Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys Gly 
210 215 220 

Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu 
225 230 235 240 

Leu Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr 
245 250 255 

Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn 
260 265 270 

Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe 
275 280 285 



Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn 
290 295 300 
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Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr 
305 310 315 320 

Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
325 330 



<210> 21 

<211> 978 

<212> DNA 

<213> Homo sapiens 



<400> 21 

gcctccacca 

agcacagccg 

tggaactcag 

ggactctact 

tacacctgca 

aaatgttgtg 

ctcttccccc 

gtggtggtgg 

gtggaggtgc 

gtggtcagcg 

aaggtctcca 

cagccccgag 

caggtcagcc 

gagagcaatg 

ggctccttct 

gtcttctcat 

tccctgtctc 



agggcccatc 
ccctgggctg 
gcgctctgac 
ccctcagcag 
acgtagatca 
tcgagtgccc 
caaaacccaa 
acgtgagcca 
ataatgccaa 
tcctcaccgt 
acaaaggcct 
aaccacaggt 
tgacctgcct 
ggcagccgga 
tcctctacag 
gctccgtgat 
cgggtaaa 



ggtcttcccc 
cctggtcaag 
cagcggcgtg 
cgtggtgacc 
caagcccagc 
accgtgccca 
ggacaccctc 
cgaagacccc 
gacaaagcca 
tgtgcaccag 
cccagccccc 
gtacaccctg 
ggtcaaaggc 
gaacaactac 
caagctcacc 
gcatgaggct 



ctggcgccct 
gactacttcc 
cacaccttcc 
gtgccctcca 
aacaccaagg 
gcaccacctg 
atgatctccc 
gaggtccagt 
cgggaggagc 
gactggctga 
atcgagaaaa 
cccccatccc 
ttctacccca 
aagaccacac 
gtggacaaga 
ctgcacaacc 



gctccaggag 
ccgaaccggt 
cagctgtcct 
gcaacttcgg 
tggacaagac 
tggcaggacc 
ggacccctga 
tcaactggta 
agttcaacag 
acggcaagga 
ccatctccaa 
gggaggagat 
gcgacatcgc 
ctcccatgct 
gcaggtggca 
actacacgca 



cacctccgag 60 
gacggtgtcg 12 0 
acagtcctca 180 
cacccagacc 240 
agttgagcgc 3 00 
gtcagtcttc 360 
ggtcacgtgc 42 0 
cgtggacggc 4 80 
cacgttccgt 540 
gtacaagtgc 600 
aaccaaaggg 660 
gaccaagaac 720 
cgtggagtgg 780 
ggactccgac 840 
gcaggggaac 900 
gaagagcctc 960 
978 



<210> 22 

<211> 326 

<212> PRT 

<213> Homo sapiens 



<400> 22 

Ala Ser Thr Lys Gly 
1 5 



Pro Ser Val 



Phe Pro Leu 
10 



Ala Pro 



Cys Ser Arg 
15 



Ser Thr Ser Glu Ser 
20 



Thr Ala Ala 



Leu Gly Cys 
25 



Leu Val 



Lys Asp Tyr 
30 



Phe Pro Glu Pro Val Thr Val Ser 
35 40 



Trp Asn Ser 



Gly Ala 
45 



Leu Thr Ser 



Gly Val His Thr Phe Pro Ala 
50 55 



Val Leu Gin Ser 



Ser Gly 
60 



Leu Tyr Ser 



Leu Ser Ser Val Val Thr Val Pro Ser Ser Asn Phe Gly 
65 70 75 

Tyr Thr Cys Asn Val Asp His Lys Pro Ser Asn Thr Lys 
85 90 

Thr Val Glu Arg Lys Cys Cys Val Glu Cys Pro Pro Cys 
100 105 



Thr Gin Thr 
80 

Val Asp Lys 
95 

Pro Ala Pro 
110 



Pro Val Ala Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp 
115 120 125 
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Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp 
130 135 140 

Val Ser His Glu Asp Pro Glu Val Gin Phe Asn Trp Tyr Val Asp Gly 
145 150 155 160 

Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Phe Asn 
165 170 175 

Ser Thr Phe Arg Val Val Ser Val Leu Thr Val Val His Gin Asp Trp . 
180 185 190 

Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro 
195 200 205 

Ala Pro lie Glu Lys Thr lie Ser Lys Thr Lys Gly Gin Pro Arg Glu 
210 215 220 

Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn 
225 230 235 240 

Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp lie 
245 250 255 

Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr 
260 265 270 

Thr Pro Pro Met Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys 
275 280 285 

Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe Ser Cys 
290 295 300 

Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu 
305 310 315 320 

Ser Leu Ser Pro Gly Lys 
325 



<210> 23 

<211> 1134 

<212> DNA 

<213> Homo sapiens 



<400> 23 

gcttccacca 

ggcacagcgg 

tggaactcag 

ggactctact 

tacacctgca 

aaaaccccac 

gacacacctc 

tgcccacggt 

gcacctgaac 

cttatgattt 

cccgaggtcc 

ccgcgggagg 

caggactggc 

cccatcgaga 



agggcccatc 
ccctgggctg 
gcgccctgac 
ccctcagcag 
acgtgaatca 
ttggtgacac 
ccccgtgccc 
gcccagagcc 
tcctgggagg 
cccggacccc 
agttcaagtg 
agcagtacaa 
tgaacggcaa 
aaaccatctc 



ggtcttcccc 
cctggtcaag 
cagcggcgtg 
cgtggtgacc 
caagcccagc 
aactcacaca 
acggtgccca 
caaatcttgt 
accgtcagtc 
tgaggtcacg 
gtacgtggac 
cagcacgttc 
ggagtacaag 
caaaaccaaa 



ctggcgccct 
gactacttcc 
cacaccttcc 
gtgccctcca 
aacaccaagg 
tgcccacggt 
gagcccaaat 
gacacacctc 
ttcctcttcc 
tgcgtggtgg 
ggcgtggagg 
cgtgtggtca 
tgcaaggtct 
ggacagcccc 



gctccaggag 
ccgaaccggt 
cggctgtcct 
gcagcttggg 
tggacaagag 
gcccagagcc 
cttgtgacac 
ccccgtgccc 
ccccaaaacc 
tggacgtgag 
tgcataatgc 
gcgtcctcac 
ccaacaaagc 
gagaaccaca 



cacctctggg 
gacggtgtcg 
acagtcctca 
cacccagacc 
agttgagctc 
caaatcttgt 
acctccccca 
aaggtgccca 
caaggatacc 
ccacgaagac 
caagacaaag 
cgtcctgcac 
cctcccagcc 
ggtgtacacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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ctgcccccat cccgggagga gatgaccaag aaccaggtca gcctgacctg cctggtcaaa 900 

ggcttctacc ccagcgacat cgccgtggag tgggagagca gcgggcagcc ggagaacaac 960 

tacaacacca cgcctcccat gctggactcc gacggctcct tcttcctcta cagcaagctc 1020 

accgtggaca agagcaggtg gcagcagggg aacatcttct catgctccgt gatgcatgag 1080 

gctctgcaca accgcttcac gcagaagagc ctctccctgt ctccgggtaa atga 1134 



<210> 24 
<211> 377 
<212> PRT 

<213> Homo sapiens 
<400> 24 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 
15 10 15 

Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
20 25 30 

Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
35 40 45 

Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 
50 55 60 

Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
65 70 75 80 

Tyr Thr Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys 
85 90 95 

Arg Val Glu Leu Lys Thr Pro Leu Gly Asp Thr Thr His Thr Cys Pro 
100 105 110 

Arg Cys Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg 
115 120 125 

Cys Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg Cys 
130 135 140 

Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg Cys Pro 
145 150 155 160 

Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys 
165 170 175 

Pro Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val 
180 185 190 

Val Val Asp Val Ser His Glu Asp Pro Glu Val Gin Phe Lys Trp Tyr 
195 200 205 

Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu 
210 215 220 

Gin Tyr Asn Ser Thr Phe Arg Val Val Ser Val Leu Thr Val Leu His 
225 230 235 240 



Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys 
245 250 255 
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Ala Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Thr Lys Gly Gin 
260 265 270 

Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met 
275 280 285 

Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro 
290 295 300 

Ser Asp lie Ala Val Glu Trp Glu Ser Ser Gly Gin Pro Glu Asn Asn 
305 310 315 320 

Tyr Asn Thr Thr Pro Pro Met Leu Asp Ser Asp Gly Ser Phe Phe Leu 
325 330 335 

Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn lie 
340 345 350 

Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn Arg Phe Thr Gin 
355 360 365 

Lys Ser Leu Ser Leu Ser Pro Gly Lys 
370 375 



<210> 25 

<211> 984 

<212> DNA 

<213> Homo sapiens 



<400> 25 

gcttccacca 

agcacagccg 

tggaactcag 

ggactctact 

tacacctgca 

aaatatggtc 

ttcctgttcc 

tgcgtggtgg 

ggcgtggagg 

cgtgtggtca 

tgcaaggtct 

gggcagcccc 

aaccaggtca 

tgggagagca 

gacggctcct 

aatgtcttct 

ctctccctgt 



agggcccatc 

ccctgggctg 
gcgccctgac 
ccctcagcag 
acgtagatca 
ccccatgccc 
ccccaaaacc 
tggacgtgag 
tgcataatgc 
gcgtcctcac 
ccaacaaagg 
gagagccaca 
gcctgacctg 
atgggcagcc 
tcttcctcta 
catgctccgt 
ctctgggtaa 



cgtcttcccc 
cctggtcaag 
cagcggcgtg 

cgtggtgacc 

caagcccagc 
atcatgccca 
caaggacact 
ccaggaagac 
caagacaaag 
cgtcctgcac 
cctcccgtcc 
ggtgtacacc 
cctggtcaaa 
ggagaacaac 
cagcaggcta 
gatgcatgag 
atga 



ctggcgccct 
gactacttcc 
cacaccttcc 
gtgccctcca 
aacaccaagg 
gcacctgagt 
ctcatgatct 
cccgaggtcc 
ccgcgggagg 
caggactggc 
tccatcgaga 
ctgcccccat 
ggcttctacc 
tacaagacca 
accgtggaca 
gctctgcaca 



gctccaggag 
ccgaaccggt 
cggctgtcct 
gcagcttggg 
tggacaagag 
tcctgggggg 
cccggacccc 
agttcaactg 
agcagttcaa 
tgaacggcaa 
aaaccatctc 
cccaggagga 
ccagcgacat 
cgcctcccgt 
agagcaggtg 
accactacac 



cacctccgag 
gacggtgtcg 
acagtcctca 
cacgaagacc 
agttgagtcc 
accatcagtc 
tgaggtcacg 
gtacgtggat 
cagcacgtac 
ggagtacaag 
caaagccaaa 
gatgaccaag 
cgccgtggag 
gctggactcc 
gcaggagggg 
acagaagagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

984 



<210> 26 
<211> 327 
<212> PRT 

<213> Homo sapiens 
<400> 26 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 
15 10 15 

Ser Thr Ser Glu Ser Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
20 25 30 
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Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
35 40 45 

Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 
50 55 60 

Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Lys Thr 
65 70 75 80 

Tyr Thr Cys Asn Val Asp His Lys Pro Ser Asn Thr Lys Val Asp Lys 
85 90 95 

Arg Val Glu Ser Lys Tyr Gly Pro Pro Cys Pro Ser Cys Pro Ala Pro 
100 105 110 

Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys 
115 120 125 

Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val 
130 135 140 

Asp Val Ser Gin Glu Asp Pro Glu Val Gin Phe Asn Trp Tyr Val Asp 
145 150 155 160 

Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Phe 
165 170 175 

Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp 
180 185 190 

Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu 
195 200 205 

Pro Ser Ser lie Glu Lys Thr lie Ser Lys Ala Lys Gly Gin Pro Arg 
210 215 220 

Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Gin Glu Glu Met Thr Lys 
225 230 235 240 

Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp 
245 250 255 

lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys 
260 265 270 

Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser 
275 280 285 

Arg Leu Thr Val Asp Lys Ser Arg Trp Gin Glu Gly Asn Val Phe Ser 
290 295 300 

Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser 
305 310 315 320 

Leu Ser Leu Ser Leu Gly Lys 
325 



<210> 27 
<211> 300 
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<212> DNA 

<213> Homo sapiens 
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<400> 27 

taggctgcct 

tctgccctaa 

tcctccgtgc 

ccagccacca 

cctgctcatt 



gtgcccccca cctgcctgtc cacaacccag cctctggtac atccatgccc 60 
gcctcacctg cacttttcct tggatttcag agtctccaaa ggcacaggcc 12 0 
ccactgcaca accccaagca gagggcagcc tcgccaaggc aaccacagcc 180 
cccgtaacac aggtgagaag ccccttccct gcacactcca cccccaccca 240 
cctcagccgc ctcctccagg cagcccttca taactccttg tctgagtctc 300 



<210> 28 
<211> 383 
<212> PRT 

<213> Homo sapiens 
<400> 28 

Ala Pro Thr Lys Ala Pro Asp Val Phe Pro lie lie Ser Gly Cys Arg 
15 10 15 

His Pro Lys Asp Asn Ser Pro Val Val Leu Ala Cys Leu lie Thr Gly 
20 25 30 

Tyr His Pro Thr Ser Val Thr Val Thr Trp Tyr Met Gly Thr Gin Ser 
35 40 45 

Gin Pro Gin Arg Thr Phe Pro Glu lie Gin Arg Arg Asp Ser Tyr Tyr 
50 55 60 

Met Thr Ser Ser Gin Leu Ser Thr Pro Leu Gin Gin Trp Arg Gin Gly 
65 70 75 80 

Glu Tyr Lys Cys Val Val Gin His Thr Ala Ser Lys Ser Lys Lys Glu 
85 90 95 

lie Phe Arg Trp Pro Glu Ser Pro Lys Ala Gin Ala Ser Ser Val Pro 
100 105 110 

Thr Ala Gin Pro Gin Ala Glu Gly Ser Leu Ala Lys Ala Thr Thr Ala 
115 120 125 

Pro Ala Thr Thr Arg Asn Thr Gly Arg Gly Gly Glu Glu Lys Lys Lys 
130 135 140 

Glu Lys Glu Lys Glu Glu Gin Glu Glu Arg Glu Thr Lys Thr Pro Glu 
145 150 155 160 

Cys Pro Ser His Thr Gin Pro Leu Gly Val Tyr Leu Leu Thr Pro Ala 
165 170 175 

Val Gin Asp Leu Trp Leu Arg Asp Lys Ala Thr Phe Thr Cys Phe Val 
180 185 190 

Val Gly Ser Asp Leu Lys Asp Ala His Leu Thr Trp Glu Val Ala Gly 
195 200 205 

Lys Val Pro Thr Gly Gly Val Glu Glu Gly Leu Leu Glu Arg His Ser 
210 215 220 



Asn Gly Ser Gin Ser Gin His Ser Arg Leu Thr Leu Pro Arg Ser Leu 
225 230 235 240 
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Trp Asn Ala Gly 



Pro Pro Gin Arg 
260 

Val Lys Leu Ser 
275 

Ala Ser Trp Leu 
290 

Leu Leu Met Trp 
305 

Ala Pro Ala Arg 



Trp Ser Val Leu 
340 

Tyr Thr Cys Val 
355 

Ser Arg Ser Leu 
370 



Thr Ser Val Thr 
245 

Leu Met Ala Leu 



Leu Asn Leu Leu 
280 

Leu Cys Glu Val 
295 

Leu Glu Asp Gin 
310 

Pro Pro Pro Gin 
325 

Arg Val Pro Ala 



Val Ser His Glu 
360 

Glu Val Ser Tyr 
375 



Cys Thr Leu Asn 
250 

Arg Glu Pro Ala 
265 

Ala Ser Ser Asp 



Ser Gly Phe Ser 
300 

Arg Glu Val Asn 
315 

Pro Gly Ser Thr 
330 

Pro Pro Ser Pro 
345 

Asp Ser Arg Thr 



Val Thr Asp His 
380 



His Pro Ser Leu 
255 

Ala Gin Ala Pro 
270 

Pro Pro Glu Ala 
285 

Pro Pro Asn lie 



Thr Ser Gly Phe 
320 

Thr Phe Trp Ala 
335 

Gin Pro Ala Thr 
350 

Leu Leu Asn Ala 
365 

Gly Pro Met 



<210> 29 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 29 

gtcattagct ggatttagcc attccacaat 
tgataaacat gtataatttt tgtcaattaa 
aagaaggaga aggagaaaga ggaacaagaa 
tgacccctgg gctatcaaaa cacctattgc 
actattgaag cgattgctgt tatgtggatg 



gtacacatat ttcaaacatt gtgttgtata 60 
aaatttttag gaagaggagg agaagagaag 12 0 
gagagagaga caaagacacc aggttttttc 180 
ccaataacta gttggccgtt ggtgccctaa 240 
ggccccggac acttagaaac tcgtgacccc 3 00 



<210> 30 

<211> 429 

<212> PRT 

<213> Homo sapiens 

<400> 30 

Pro Thr Lys Ala Pro Asp Val Phe Pro lie lie Ser Gly Cys Arg His 
1 5 10 15 

Pro Lys Asp Asn Ser Pro Val Val Leu Ala Cys Leu lie Thr Gly Tyr 
20 25 30 

His Pro Thr Ser Val Thr Val Thr Trp Tyr Met Gly Thr Gin Ser Gin 
35 40 45 

Pro Gin Arg Thr Phe Pro Glu lie Gin Arg Arg Asp Ser Tyr Tyr Met 
50 55 60 



Thr Ser Ser Gin Leu Ser Thr Pro Leu Gin Gin Trp Arg Gin Gly Glu 



WO 03/064992 PCT/US02/34197 

27/119 

65 70 75 80 

Tyr Lys Cys Val Val Gin His Thr Ala Ser Lys Ser Lys Lys Glu lie 
85 90 95 

Phe Arg Trp Pro Glu Ser Pro Lys Ala Gin Ala Ser Ser Val Pro Thr 
100 105 110 

Ala Gin Pro Gin Ala Glu Gly Ser Leu Ala Lys Ala Thr Thr Ala Pro 
115 120 125 

Ala Thr Thr Arg Asn Thr Gly Arg Gly Gly Glu Glu Lys Lys Lys Glu 
130 135 140 

Lys Glu Lys Glu Glu Gin Glu Glu Arg Glu Thr Lys Thr Pro Glu Cys 
145 150 155 160 

Pro Ser His Thr Gin Pro Leu Gly Val Tyr Leu Leu Thr Pro Ala Val 
165 170 175 

Gin Asp Leu Trp Leu Arg Asp Lys Ala Thr Phe Thr Cys Phe Val Val 
180 185 190 

Gly Ser Asp Leu Lys Asp Ala His Leu Thr Trp Glu Val Ala Gly Lys 
195 200 205 

Val Pro Thr Gly Gly Val Glu Glu Gly Leu Leu Glu Arg His Ser Asn 
210 215 220 

Gly Ser Gin Ser Gin His Ser Arg Leu Thr Leu Pro Arg Ser Leu Trp 
225 230 235 240 

Asn Ala Gly Thr Ser Val Thr Cys Thr Leu Asn His Pro Ser Leu Pro 
245 250 255 

Pro Gin Arg Leu Met Ala Leu Arg Glu Pro Ala Ala Gin Ala Pro Val 
260 265 270 

Lys Leu Ser Leu Asn Leu Leu Ala Ser Ser Asp Pro Pro Glu Ala Ala 
275 280 285 

Ser Trp Leu Leu Cys Glu Val Ser Gly Phe Ser Pro Pro Asn lie Leu 
290 295 300 

Leu Met Trp Leu Glu Asp Gin Arg Glu Val Asn Thr Ser Gly Phe Ala 
305 310 315 320 

Pro Ala Arg Pro Pro Pro Gin Pro Arg Ser Thr Thr Phe Trp Ala Trp 
325 330 335 

Ser Val Leu Arg Val Pro Ala Pro Pro Ser Pro Gin Pro Ala Thr Tyr 
340 345 350 

Thr Cys Val Val Ser His Glu Asp Ser Arg Thr Leu Leu Asn Ala Ser 
355 360 365 

Arg Ser Leu Glu Val Ser Tyr Leu Ala Met Thr Pro Leu lie Pro Gin 
370 375 380 

Ser Lys Asp Glu Asn Ser Asp Asp Tyr Thr Thr Phe Asp Asp Val Gly 
385 390 395 400 
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Ser Leu Trp Thr Thr Leu Ser Thr Phe Val Ala Leu Phe lie Leu Thr 
405 410 415 

Leu Leu Tyr Ser Gly lie Val Thr Phe lie Lys Val Lys 
420 425 



<210> 31 

<211> 500 

<212> DNA 

<213> Homo sapiens 

<400> 31 

gaagctgggg agaggagagc acagtggtta agtcagtccc tgcagcccaa ctgctcccga 6 0 
aggtccggcc acagctgctc tcgtttgctc tcccctgcag agtgtccgag ccacacccag 120 
cctcttggcg tctacctgct aacccctgca gtgcaggacc tgtggctccg ggacaaagcc 180 
accttcacct gcttcgtggt gggcagtgac ctgaaggatg ctcacctgac ctgggaggtg 24 0 
gctgggaagg tccccacagg gggcgtggag gaagggctgc tggagcggca cagcaacggc 3 00 
tcccagagcc agcacagccg tctgaccctg cccaggtcct tgtggaacgc ggggacctcc 3 60 
gtcacctgca cactgaacca tcccagcctc ccaccccaga ggttgatggc gctgagagaa 420 
cccggtgagc ctggctccca ggtggggaga cgagggtgcc cacagcctgc tgacccctac 4 80 
gcccgcccca gggccatgac 5 00 

<210> 32 

<211> 383 

<212> PRT 

<213> Homo sapiens 

<400> 32 

Pro Thr Lys Ala Pro Asp Val Phe Pro lie lie Ser Gly Cys Arg His 
15 10 15 

Pro Lys Asp Asn Ser Pro Val Val Leu Ala Cys Leu lie Thr Gly Tyr 
20 25 30 

His Pro Thr Ser Val Thr Val Thr Trp Tyr Met Gly Thr Gin Ser Gin 
35 40 45 

Pro Gin Arg Thr Phe Pro Glu lie Gin Arg Arg Asp Ser Tyr Tyr Met 
50 55 60 

Thr Ser Ser Gin Leu Ser Thr Pro Leu Gin Gin Trp Arg Gin Gly Glu 
65 70 75 80 

Tyr Lys Cys Val Val Gin His Thr Ala Ser Lys Ser Lys Lys Glu lie 
85 90 95 

Phe Arg Trp Pro Glu Ser Pro Lys Ala Gin Ala Ser Ser Val Pro Thr 
100 105 110 

Ala Gin Pro Gin Ala Glu Gly Ser Leu Ala Lys Ala Thr Thr Ala Pro 
115 120 125 

Ala Thr Thr Arg Asn Thr Gly Arg Gly Gly Glu Glu Lys Lys Lys Glu 
130 135 140 

Lys Glu Lys Glu Glu Gin Glu Glu Arg Glu Thr Lys Thr Pro Glu Cys 
145 150 155 160 
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Pro Ser His Thr Gin Pro Leu Gly Val Tyr Leu Leu Thr Pro Ala Val 
165 170 175 

Gin Asp Leu Trp Leu Arg Asp Lys Ala Thr Phe Thr Cys Phe Val Val 
180 185 190 

Gly Ser Asp Leu Lys Asp Ala His Leu Thr Trp Glu Val Ala Gly Lys 
195 200 205 

Val Pro Thr Gly Gly Val Glu Glu Gly Leu Leu Glu Arg His Ser Asn 
210 215 220 

Gly Ser Gin Ser Gin His Ser Arg Leu Thr Leu Pro Arg Ser Leu Trp 
225 230 235 240 

Asn Ala Gly Thr Ser Val Thr Cys Thr Leu Asn His Pro Ser Leu Pro 
245 250 255 

Pro Gin Arg Leu Met Ala Leu Arg Glu Pro Ala Ala Gin Ala Pro Val 
260 265 270 

Lys Leu Ser Leu Asn Leu Leu Ala Ser Ser Asp Pro Pro Glu Ala Ala 
275 280 285 

Ser Trp Leu Leu Cys Glu Val Ser Gly Phe Ser Pro Pro Asn lie Leu 
290 295 300 

Leu Met Trp Leu Glu Asp Gin Arg Glu Val Asn Thr Ser Gly Phe Ala 
305 310 315 320 

Pro Ala Arg Pro Pro Pro Gin Pro Arg Ser Thr Thr Phe Trp Ala Trp 
325 330 335 

Ser Val Leu Arg Val Pro Ala Pro Pro Ser Pro Gin Pro Ala Thr Tyr 
340 345 350 

Thr Cys Val Val Ser His Glu Asp Ser Arg Thr Leu Leu Asn Ala Ser 
355 360 365 

Arg Ser Leu Glu Val Ser Tyr Val Thr Asp His Gly Pro Met Lys 
370 375 380 



<210> 33 
<211> 500 
<212> DNA 

<213> Homo sapiens 
<400> 33 

ccacaggaaa ggagaaggga 
cccgtggcca gagcctgaca 
ctttctctga acctgctggc 
gaggtgtctg gcttctcgcc 
gtgaacactt ctgggtttgc 
tgggcctgga gtgtgctgcg 
tgtgtggtca gccacgagga 
agctgtgagt cacccccagg 
ctggaatcca tactaggcag 



ggcaccacac cctggccggc 
gcccccccac ctccccgcag 
ctcgtctgac cctcccgagg 
ccccaacatc ctcctgatgt 
ccccgcacgc ccccctccac 
tgtcccagcc ccgcccagcc 
ctcccggact ctgctcaacg 
ccagggttgg gacggggact 



cccacttctc tcccagtgcc 60 
ctgcgcaggc acccgtcaag 120 
cggcctcgtg gctcctgtgt 180 
ggctggagga ccagcgtgag 24 0 
agcccaggag caccacgttc 300 
ctcagccagc cacctacacg 360 
ccagccggag cctagaagtc 42 0 
ctgagggggg ccataaggag 4 80 

500 



<210> 34 
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<400> 34 
000 

<210> 35 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 35 

Pro Thr Lys Ala Pro Asp Val Phe Pro lie lie Ser Gly Cys Arg His 
15 10 15 

Pro Lys Asp Asn Ser Pro Val Val Leu Ala 
20 25 



<210> 36 

<211> 100 

<212> DNA 

<213> Homo sapiens 



<400> 36 

gacacgccga ttttttgtta ttagatgtaa cagaccatgg ccccatgaaa tgatcccgga 60 
ccagatccgt ccgcacccgc cactcagcag ctctggccga 100 



<210> 37 

<400> 37 
000 



<210> 38 

<211> 200 

<212> DNA 

<213> Homo sapiens 



<400> 38 

cgctcggccc ccgttcctcc ccagacctgg ccatgacccc cctgatccct cagagcaagg 60 
atgagaacag cgatgactac acgacctttg atgatgtggg cagcctgtgg accaccctgt 12 0 
ccacgtttgt ggccctcttc atcctcaccc tcctctacag cggcattgtc actttcatca 180 
aggtcagggg agcggccagg 2 00 



<210> 39 

<400> 39 
000 



<210> 40 

<211> 100 

<212> DNA 

<213> Homo sapiens 



<400> 40 

tcaggcttct agcccctgtc tgaccccagg ggctgtcttt caggtgaagt agccccagaa 6 0 
gagcaggacg ccctgtacct gcagagaagg gaagcagcct 100 



WO 03/064992 
<210> 41 



31/119 



PCT/US02/34197 



<400> 41 
000 



<210> 42 

<211> 495 

<212> DNA 

<213> Homo sapiens 



<400> 42 

tttccctgcc 

aggtttccag 

gtgttcccca 

tgcttgataa 

agccagcccc 

agccagctct 

cacaccgcca 

tcacccagaa 

cagaagcaag 



tcccgtcacc 
cctcagactc 
tcatatcagg 
ctgggtacca 
agagaacctt 
ccacccccct 
gcaagagtaa 
gggcccccca 
ggctg 



ctgccgccag 
ccactgtgtc 
gtgcagacac 
cccaacgtcc 
ccctgagata 
ccagcagtgg 
gaaggagatc 
ggacccccag 



ggcctctgcc 
tgtcttccag 
ccaaaggata 
gtgactgtca 
caaagacggg 
cgccaaggcg 
ttccgctggc 
caccttccac 



ctgccctgcc 
cacccaccaa 
acagccctgt 
cctggtacat 
acagctacta 
agtacaaatg 
caggtaggtc 
tcagggcctg 



ccttgtcctc 
ggctccggat 
ggtcctggca 
ggggacacag 
catgacaagc 
cgtggtccag 
gcaccggaga 
accacaaaga 



60 

120 

180 

240 

300 

360 

420 

480 

495 



<210> 43 

<400> 43 
000 



<210> 44 

<211> 1920 

<212> DNA 

<213> Homo sapiens 



<400> 44 

ggatccctgc 

gcctgaggct 

ttccccttga 

tgcctggcca 

aacgggacaa 

atcagcttgc 

cacactccat 

agctcagaga 

tgagggggtg 

acgtgggggc 

agctctcccc 

tcctgcgacg 

tacaccccag 

ttgtccaccg 

ctcagccaga 

cacacctttg 

gccacggagg 

attccaaccc 

tccgcaagtc 

tgaacctgac 

agaagcagcg 

ggatcgaggg 

tgcggtccac 

agggagcgag 

cccggaagtc 

cgcctgcctg 



cacggggtcc 
ggcactgact 
cccgctgctg 
cgggctactt 
ctatgacctt 
tgaccgtctc 
cgtccacaga 
ccacagttcc 
ggtgggctca 
aagaggaggg 
cagtctgctc 
gcggcgggca 
ggactatcaa 
cctctaccac 
agcactggct 
aggacagcac 
ccagagaaga 
gagaggggtg 
gcccacgatc 
ctggtcccgg 
caatggcacg 
ggagacctac 
gaccaagacc 
tgagcggggc 
tatgcgtttg 
atccagaact 



ccagctcccc 
aggttctgtc 
caaaaacatt 
cccggagccg 
accagccacc 
gggtgcgtgg 
ctgggtcgac 
caggagtgcc 
aacgtgggaa 
cacacagagc 
cagggacttc 
cttccccccg 
catcacctgg 
gcaggagggt 
gtcagaccgc 
caagaagtgt 
ggggcgggtg 
agcgcctacc 
acctgtctgg 
gccagtggga 
ttaaccgtca 
cagtgcaggg 
agcggtgagc 
ccgggctgac 
cgacgccgga 
tcatgcctga 



catccaggcc 
ctcacagcct 
ccctccaatg 
gtgatggtga 
accctcacgc 
gccaagcaga 
aacaaaacct 
aggctgaggg 
cacccagcat 
tcagagaggc 
accccgccca 
accatccagc 
ctggaggacg 
gagctggcct 
acctacacct 
gcaggtacgt 
ggcctcacac 
taagccggcc 
tggtggacct 
agcctgtgaa 
cgtccaccct 
tgacccaccc 
catgggcagg 
cccacgtctg 
gtggccgggg 
ggacatctcg 



ccccaggctg 
ccacacagag 
ccacctccgt 
cctgggacac 
tctctggtca 
tgttcacctg 
tcagcggtaa 
ctggcagagt 
gcctggggac 
caacaaccct 
ccgtgaagat 
tcctgtgcct 
ggcaggtcat 
ccacacaaag 
gccaggtcac 
tcccacctgc 
agccctccgg 
cagcccgttc 
ggcacccagc 
ccactccacc 
gccggtgggc 
ccacctgccc 
ccggggtcgt 
gccacaggcc 
agccgggaca 
gtgcagtggc 



atgggcgctg 
cccatccgtc 
gactctgggc 
aggctccctc 
ctatgccacc 
ccgtgtggca 
gagagggcca 
gggcaggggt 
ccgggccagg 
catgaccacc 
cttacagtcg 
cgtctctggg 
ggacgtggac 
cgagctcacc 
ctatcaaggt 
cctggtggcc 
tgtaccacag 
gacctgttca 
aaggggaccg 
agaaaggagg 
acccgagact 
agggccctca 
gggggaaggg 
cgcgtgctgc 
agcgcaccct 
tgcacaacga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 



WO 03/064992 PCT/US02/34197 

32/119 

ggtgcagctc ccggacgccc ggcacagcac gacgcagccc cgcaagacca agggctccgg 162 0 

cttcttcgtc ttcagccgcc tggaggtgac cagggccgaa tgggagcaga aagatgagtt 168 0 

catctgccgt gcagtccatg aggcagcgag cccctcacag accgtccagc gagcggtgtc 174 0 

tgtaaatccc ggtaaatgac gtactcctgc ctccctccct cccagggctc catccagctg 1800 

tgcagtgggg aggactggcc agaccttctg tccactgttg caatgacccc aggaagctac 1860 

ccccaataaa ctgtgcctgc tcagagcccc agtacaccca ttcttgggag cgggcagggc 192 0 



<210> 45 
<211> 574 
<212> PRT 

<213> Homo sapiens 
<400> 45 

Met Asp Trp Thr Trp lie Leu Phe Leu Val Ala Ala Ala Thr Arg Val 
1 5 10 15 

His Ser Gin Thr Gin Leu Val Gin Ser Gly Ala Glu Val Arg Lys Pro 
20 25 30 

Gly Ala Ser Val Arg Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe lie 
35 40 45 

Asp Ser Tyr lie His Trp lie Arg Gin Ala Pro Gly His Gly Leu Glu 
50 55 60 

Trp Val Gly Trp lie Asn Pro Asn Ser Gly Gly Thr Asn Tyr Ala Pro 
65 70 75 80 

Arg Phe Gin Gly Arg Val Thr Met Thr Arg Asp Ala Ser Phe Ser Thr 
85 90 95 

Ala Tyr Met Asp Leu Arg Ser Leu Arg Ser Asp Asp Ser Ala Val Phe 
100 105 110 

Tyr Cys Ala Lys Ser Asp Pro Phe Trp Ser Asp Tyr Tyr Asn Phe Asp 
115 120 125 

Tyr Ser Tyr Thr Leu Asp Val Trp Gly Gin Gly Thr Thr Val Thr Val 
130 135 140 

Ser Ser Ala Ser Thr Gin Ser Pro Ser Val Phe Pro Leu Thr Arg Cys 
145 150 155 160 

Cys Lys Asn lie Pro Ser Asn Ala Thr Ser Val Thr Leu Gly Cys Leu 
165 170 175 

Ala Thr Gly Tyr Phe Pro Glu Pro Val Met Val Thr Trp Asp Thr Gly 
180 185 190 

Ser Leu Asn Gly Thr Thr Met Thr Leu Pro Ala Thr Thr Leu Thr Leu 
195 200 205 

Ser Gly His Tyr Ala Thr lie Ser Leu Leu Thr Val Ser Gly Ala Trp 
210 215 220 

Ala Lys Gin Met Phe Thr Cys Arg Val Ala His Thr Pro Ser Ser Thr 
225 230 235 240 



Asp Trp Val Asp Asn Lys Thr Phe Ser Val Cys Ser Arg Asp Phe Thr 
245 250 255 
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Pro Pro Thr Val Lys lie Leu Gin Ser Ser Cys Asp Gly Gly Gly His 
260 265 270 

Phe Pro Pro Thr lie Gin Leu Leu Cys Leu Val Ser Gly Tyr Thr Pro 
275 280 285 

Gly Thr lie Asn lie Thr Trp Leu Glu Asp Gly Gin Val Met Asp Val 
290 295 300 

Asp Leu Ser Thr Ala Ser Thr Thr Gin Glu Gly Glu Leu Ala Ser Thr 
305 310 315 320 

Gin Ser Glu Leu Thr Leu Ser Gin Lys His Trp Leu Ser Asp Arg Thr 
325 330 335 

Tyr Thr Cys Gin Val Thr Tyr Gin Gly His Thr Phe Glu Asp Ser Thr 
340 345 350 

Lys Lys Cys Ala Asp Ser Asn Pro Arg Gly Val Ser Ala Tyr Leu Ser 
355 360 365 

Arg Pro Ser Pro Phe Asp Leu Phe lie Arg Lys Ser Pro Thr lie Thr 
370 375 380 

Cys Leu Val Val Asp Leu Ala Pro Ser Lys Gly Thr Val Asn Leu Thr 
385 390 395 400 

Trp Ser Arg Ala Ser Gly Lys Pro Val Asn His Ser Thr Arg Lys Glu 
405 410 415 

Glu Lys Gin Arg Asn Gly Thr Leu Thr Val Thr Ser Thr Leu Pro Val 
420 425 430 

Gly Thr Arg Asp Trp lie Glu Gly Glu Thr Tyr Gin Cys Arg Val Thr 
435 440 445 

His Pro His Leu Pro Arg Ala Leu Met Arg Ser Thr Thr Lys Thr Ser 
450 455 460 

Gly Pro Arg Ala Ala Pro Glu Val Tyr Ala Phe Ala Thr Pro Glu Trp 
465 470 475 480 

Pro Gly Ser Arg Asp Lys Arg Thr Leu Ala Cys Leu lie Gin Asn Phe 
485 490 495 

Met Pro Glu Asp lie Ser Val Gin Trp Leu His Asn Glu Val Gin Leu 
500 505 510 

Pro Asp Ala Arg His Ser Thr Thr Gin Pro Arg Lys Thr Lys Gly Ser 
515 520 525 

Gly Phe Phe Val Phe Ser Arg Leu Glu Val Thr Arg Ala Glu Trp Glu 
530 535 540 

Gin Lys Asp Glu Phe lie Cys Arg Ala Val His Glu Ala Ala Ser Pro 
545 550 555 560 

Ser Gin Thr Val Gin Arg Ala Val Ser Val Asn Pro Gly Lys 
565 570 
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<210> 46 

<211> 2213 

<212> DNA 

<213> Homo sapiens 



<400> 46 

gctctagaac 

cagctcatca 

gtccagtccc 

gtgaaggtct 

cgacaggccc 

gcaaactacg 

acagcctaca 

aaaaccggga 

tacggtatgg 

gccccaaccc 

gccgttggct 

aagaacaact 

tacgcagcca 

cacgtggtgt 

gtgattgctg 

ggcaaccccc 

attcaggtgt 

gtgcaggctg 

atcaaagaga 

ctgaccttcc 

gtcttcgcca 

tgcctggtca 

ggcgaagctg 

gccgtgggtg 

accgtgaccc 

gtggccctgc 

cgggagtcgg 

cagtggatgc 

cctgagcccc 

gaatggaaca 

gtcaccgaga 

tttgagaacc 

tacagtacca 

cagagagagg 

gttggcacgt 

tctcgaccgg 

tttgctgtgt 



tagtggatcc 
ccatggactg 
aggtgcagct 
cctgcaaggc 
ctggacaagg 
cacagaagtt 
tggagctgag 
tcctggggcc 
acgtctgggg 
ttttccccct 
gcctcgcaca 
ctgacatcag 
cctcacaggt 
gcaaagtcca 
agctgcctcc 
gcagcaagtc 
cctggctgcg 
aggccaaaga 
gcgactggct 
agcagaatgc 
tccccccatc 
cagacctgac 
tgaaaaccca 
aggccagcat 
acacagacct 
acaggcccga 
ccaccatcac 
agagggggca 
aggccccagg 
cgggggagac 

ggaccgtgga 
tgtgggccac 
ccgtcacctt 
aactcaaagg 
gtttctcttc 
caggggctgg 
atacgcttgt 



cccgggctgc 
gacctggagg 
ggtgcagtct 
ttctggaggc 
gcttgagtgg 
ccagggcaga 
cagcctgaga 
gtatagcagt 
ccaagggacc 
cgtctcctgt 
ggacttcctt 
cagcacccgg 
gctgctgcct 
gcaccccaac 
caaagtgagc 
caagctcatc 
cgaggggaag 
gtctgggccc 
cagccagagc 
gtcctccatg 
ctttgccagc 
cacctatgac 
caccaacatc 
ctgcgaggat 
gccctcgcca 
tgtctacttg 
gtgcctggtg 
gcccttgtcc 
ccggtacttc 
ctacacctgc 
caagtccacc 
cgcctccacc 
gttcaaggtg 
ggcgctgcct 
ccgcccggcc 
ctggcttgca 
tgccctgaaa 



aggaattctc 
ttcctctttg 
ggggctgagg 
accttcagca 
atgggaggga 
gtcacgatta 
tctgaggaca 
ggctggtacc 
acggtcaccg 
gagaattccc 
cccgactcca 
ggcttcccat 
tccaaggacg 
ggcaacaaag 
gtcttcgtcc 
tgccaggcca 
caggtggggt 
acgacctaca 
atgttcacct 
tgtgtccccg 
atcttcctca 
agcgtgacca 
tccgagagcc 
gactggaatt 
ctgaagcaga 
ctgccaccag 
acgggcttct 
ccggagaagt 
gcccacagca 
gtggtggccc 
gagggggagg 
ttcatcgtcc 
aaatgatccc 
ccgggtctgg 
tccagttgtg 
ggccacgagg 
taaatatgca 



taaagaagcc 
tggtggcagc 
tgaagaagcc 
gctatgctat 
tcatccctat 
ccgcggacga 
cggccgtgta 
cgaactcgga 
tctcctcagg 
cgtcggatac 
tcactttctc 
cagtcctgag 
tcatgcaggg 
aaaagaacgt 
caccccgcga 
cgggtttcag 
ctggcgtcac 
aggtgaccag 
gccgcgtgga 
atcaagacac 
ccaagtccac 
tctcctggac 
accccaatgc 
ccggggagag 
ccatctcccg 
cccgggagca 
ctcccgcgga 
atgtgaccag 
tcctgaccgt 
atgaggccct 
tgagcgccga 
tcttcctcct 
aacagaagaa 
ggtcctggcc 
tgctctcaca 
tgggctctac 
cattttatcc 



cctgggagca 
agctacaggt 
tgggtcctcg 
cagctgggtg 
ctttggtaca 
atccacgagc 
ttactgtgcg 
ctactactac 
gagtgcatcc 
gagcagcgtg 
ctggaaatac 

agggggcaag 

cacagacgaa 
gcctcttcca 
cggcttcttc 
tccccggcag 
cacggaccag 
cacactgacc 
tcacaggggc 
agccatccgg 
caagttgacc 
ccgccagaat 
cactttcagc 
gttcacgtgc 
gcccaagggg 
gctgaacctg 
cgtcttcgtg 
cgccccaatg 
gtccgaagag 
gcccaacagg 
cgaggagggc 
gagcctcttc 
catcggagac 
tgcgtggcct 
caggcttcct 
cccacactgc 
atg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2213 



<210> 47 
<211> 627 
<212> PRT 

<213> Homo sapiens 
<400> 47 

Met Asp Trp Thr Trp Arg Phe Leu Phe Val Val Ala Ala Ala Thr Gly 
15 10 15 

Val Gin Ser Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys 
20 25 30 

Pro Gly Ser Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly Thr Phe 
35 40 45 
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Ser Ser Tyr Ala lie Ser Trp Val Arg Gin Ala Pro Gly Gin Gly Leu 
50 55 60 

Glu Trp Met Gly Gly lie lie Pro lie Phe Gly Thr Ala Asn Tyr Ala 
65 70 75 80 

Gin Lys Phe Gin Gly Arg Val Thr lie Thr Ala Asp Glu Ser Thr Ser 
85 ~ 90 95 

Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val 
100 105 110 

Tyr Tyr Cys Ala Lys Thr Gly He Leu Gly Pro Tyr Ser Ser Gly Trp 
115 120 125 

Tyr Pro Asn Ser Asp Tyr Tyr Tyr Tyr Gly Met Asp Val Trp Gly Gin 
130 135 140 

Gly Thr Thr Val Thr Val Ser Ser Gly Ser Ala Ser Ala Pro Thr Leu 
145 150 155 160 

Phe Pro Leu Val Ser Cys Glu Asn Ser Pro Ser Asp Thr Ser Ser Val 
165 170 175 

Ala Val Gly Cys Leu Ala Gin Asp Phe Leu Pro Asp Ser He Thr Phe 
180 185 190 

Ser Trp Lys Tyr Lys Asn Asn Ser Asp He Ser Ser Thr Arg Gly Phe 
195 200 205 

Pro Ser Val Leu Arg Gly Gly Lys Tyr Ala Ala Thr Ser Gin Val Leu 
210 215 220 

Leu Pro Ser Lys Asp Val Met Gin Gly Thr Asp Glu His Val Val Cys 
225 230 235 240 

Lys Val Gin His Pro Asn Gly Asn Lys Glu Lys Asn Val Pro Leu Pro 
245 250 255 

Val He Ala Glu Leu Pro Pro Lys Val Ser Val Phe Val Pro Pro Arg 
260 265 270 

Asp Gly Phe Phe Gly Asn Pro Arg Ser Lys Ser Lys Leu He Cys Gin 
275 280 285 

Ala Thr Gly Phe Ser Pro Arg Gin He Gin Val Ser Trp Leu Arg Glu 
290 295 300 

Gly Lys Gin Val Gly Ser Gly Val Thr Thr Asp Gin Val Gin Ala Glu 
305 310 315 320 

Ala Lys Glu Ser Gly Pro Thr Thr Tyr Lys Val Thr Ser Thr Leu Thr 
325 330 335 

He Lys Glu Ser Asp Trp Leu Ser Gin Ser Met Phe Thr Cys Arg Val 
340 345 350 

Asp His Arg Gly Leu Thr Phe Gin Gin Asn Ala Ser Ser Met Cys Val 
355 360 365 



Pro Asp Gin Asp Thr Ala He Arg Val Phe Ala He Pro Pro Ser Phe 
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370 375 380 

Ala Ser lie Phe Leu Thr Lys Ser Thr Lys Leu Thr Cys Leu Val Thr 
385 390 395 400 

Asp Leu Thr Thr Tyr Asp Ser Val Thr He Ser Trp Thr Arg Gin Asn 
405 410 415 

Gly Glu Ala Val Lys Thr His Thr Asn He Ser Glu Ser His Pro Asn 
420 425 430 

Ala Thr Phe Ser Ala Val Gly Glu Ala Ser He Cys Glu Asp Asp Trp 
435 440 445 

Asn Ser Gly Glu Arg Phe Thr Cys Thr Val Thr His Thr Asp Leu Pro 
450 455 460 

Ser Pro Leu Lys Gin Thr He Ser Arg Pro Lys Gly Val Ala Leu His 
465 470 475 480 

Arg Pro Asp Val Tyr Leu Leu Pro Pro Ala Arg Glu Gin Leu Asn Leu 
485 490 495 

Arg Glu Ser Ala Thr He Thr Cys Leu Val Thr Gly Phe Ser Pro Ala 
500 505 510 

Asp Val Phe Val Gin Trp Met Gin Arg Gly Gin Pro Leu Ser Pro Glu 
515 520 525 

Lys Tyr Val Thr Ser Ala Pro Met Pro Glu Pro Gin Ala Pro Gly Arg 
530 535 540 

Tyr Phe Ala His Ser He Leu Thr Val Ser Glu Glu Glu Trp Asn Thr 
545 550 555 560 

Gly Glu Thr Tyr Thr Cys Val Val Ala His Glu Ala Leu Pro Asn Arg 
565 570 575 

Val Thr Glu Arg Thr Val Asp Lys Ser Thr Glu Gly Glu Val Ser Ala 
580 585 590 

Asp Glu Glu Gly Phe Glu Asn Leu Trp Ala Thr Ala Ser Thr Phe He 
595 600 605 

Val Leu Phe Leu Leu Ser Leu Phe Tyr Ser Thr Thr Val Thr Leu Phe 
610 615 620 

Lys Val Lys 
625 



<210> 48 
<211> 822 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Protein 
encoded by plasmid pSSPICAMHuA2 



<400> 48 
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Met Gly Ser Lys 
1 

Phe Thr Ser Thr 
20 

lie Leu Pro Arg 
35 

Asp Gin Pro Lys 
50 

Leu Leu Leu Pro 
65 

Gin Glu Asp Ser 



Ser Thr Ala Lys 
100 

Glu Leu Ala Pro 
115 

Leu Arg Cys Gin 
130 

Val Leu Leu Arg 
145 

Glu Pro Ala Glu 



Gly Ala Asn Phe 
180 

Leu Glu Leu Phe 
195 

Val Leu Pro Ala 
210 

Val Asp Thr Gin 
225 

Val Ser Glu Ala 



Pro Thr Val Thr 
260 

Ser Val Thr Ala 
275 

lie Leu Gly Asn 
290 

Ser Phe Pro Ala 
305 



Pro Phe Leu Ser 
5 

Ser Leu Ala Gin 



Gly Gly Ser Val 
40 

Leu Leu Gly lie 
55 

Gly Asn Asn Arg 
70 

Gin Pro Met Cys 
85 

Thr Phe Leu Thr 



Leu Pro Ser Trp 
120 

Val Glu Gly Gly 
135 

Gly Glu Lys Glu 
150 

Val Thr Thr Thr 
165 

Ser Cys Arg Thr 



Glu Asn Thr Ser 
200 

Thr Pro Pro Gin 
215 

Gly Thr Val Val 
230 

Gin Val His Leu 
245 

Tyr Gly Asn Asp 



Glu Asp Glu Gly 
280 

Gin Ser Gin Glu 
295 

Pro Asn Val lie 
310 



37/119 

Leu Leu Ser Leu 
10 

Thr Ser Val Ser 
25 

Leu Val Thr Cys 



Glu Thr Pro Leu 
60 

Lys Val Tyr Glu 
75 

Tyr Ser Asn Cys 
90 

Val Tyr Trp Thr 
105 

Gin Pro Val Gly 



Ala Pro Arg Ala 
140 

Leu Lys Arg Glu 
155 

Val Leu Val Arg 
170 

Glu Leu Asp Leu 
185 

Ala Pro Tyr Gin 



Leu Val Ser Pro 
220 

Cys Ser Leu Asp 
235 

Ala Leu Gly Asp 
250 

Ser Phe Ser Ala 
265 

Thr Gin Arg Leu 



Thr Leu Gin Thr 
300 

Leu Thr Lys Pro 
315 



Ser Leu Leu Leu 
15 

Pro Ser Lys Val 
30 

Ser Thr Ser Cys 
45 

Pro Lys Lys Glu 



Leu Ser Asn Val 
80 

Pro Asp Gly Gin 
95 

Pro Glu Arg Val 
110 

Lys Asn Leu Thr 
125 

Asn Leu Thr Val 



Pro Ala Val Gly 
160 

Arg Asp His His 
175 

Arg Pro Gin Gly 
190 

Leu Gin Thr Phe 
205 

Arg Val Leu Glu 



Gly Leu Phe Pro 
240 

Gin Arg Leu Asn 
255 

Lys Ala Ser Val 
270 

Thr Cys Ala Val 
285 

Val Thr lie Tyr 



Glu Val Ser Glu 
320 



Gly Thr Glu Val Thr Val Lys Cys Glu Ala His Pro Arg Ala Lys Val 
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325 330 335 

Thr Leu Asn Gly Val Pro Ala Gin Pro Leu Gly Pro Arg Ala Gin Leu 
340 345 350 

Leu Leu Lys Ala Thr Pro Glu Asp Asn Gly Arg Ser Phe Ser Cys Ser 
355 360 365 

Ala Thr Leu Glu Val Ala Gly Gin Leu lie His Lys Asn Gin Thr Arg 
370 375 380 

Glu Leu Arg Val Leu Tyr Gly Pro Arg Leu Asp Glu Arg Asp Cys Pro 
385 390 395 400 

Gly Asn Trp Thr Trp Pro Glu Asn Ser Gin Gin Thr Pro Met Cys Gin 
405 410 415 

Ala Trp Gly Asn Pro Leu Pro Glu Leu Lys Cys Leu Lys Asp Gly Thr 
420 425 430 

Phe Pro Leu Pro lie Gly Glu Ser Val Thr Val Thr Arg Asp Leu Glu 
435 440 445 

Gly Thr Tyr Leu Cys Arg Ala Arg Ser Thr Gin Gly Glu Val Thr Arg 
450 455 460 

Glu Val Thr Val Asn Val Thr Ser Gly Ser Ser Ala Ser Pro Thr Ser 
465 470 475 480 

Pro Lys Val Phe Pro Leu Ser Leu Asp Ser Thr Pro Gin Asp Gly Asn 
485 490 495 

Val Val Val Ala Cys Leu Val Gin Gly Phe Phe Pro Gin Glu Pro Leu 
500 505 510 

Ser Val Thr Trp Ser Glu Ser Gly Gin Asn Val Thr Ala Arg Asn Phe 
515 520 525 

Pro Pro Ser Gin Asp Ala Ser Gly Asp Leu Tyr Thr Thr Ser Ser Gin 
530 535 540 

Leu Thr Leu Pro Ala Thr Gin Cys Pro Asp Gly Lys Ser Val Thr Cys 
545 550 555 560 

His Val Lys His Tyr Thr Asn Ser Ser Gin Asp Val Thr Val Pro Cys 
565 570 575 

Arg Val Pro Pro Pro Pro Pro Cys Cys His Pro Arg Leu Ser Leu His 
580 585 590 

Arg Pro Ala Leu Glu Asp Leu Leu Leu Gly Ser Glu Ala Asn Leu Thr 
595 600 605 

Cys Thr Leu Thr Gly Leu Arg Asp Ala Ser Gly Ala Thr Phe Thr Trp 
610 615 620 

Thr Pro Ser Ser Gly Lys Ser Ala Val Gin Gly Pro Pro Glu Arg Asp 
625 630 635 640 



Leu Cys Gly Cys Tyr Ser Val Ser Arg Val Leu Pro Gly Cys Ala Gin 
645 650 655 
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Pro Trp Asn His Gly Glu Thr Phe Thr Cys Thr Ala Ala His Pro Glu 
660 665 670 

Leu Lys Thr Pro Leu Thr Ala Asn lie Thr Lys Ser Gly Asn Thr Phe 
675 680 685 

Arg Pro Glu Val His Leu Leu Pro Pro Pro Ser Glu Glu Leu Ala Leu 
690 695 700 

Asn Glu Leu Val Thr Leu Thr Cys Leu Ala Arg Gly Phe Ser Pro Lys 
705 710 715 720 

Asp Val Leu Val Arg Trp Leu Gin Gly Ser Gin Glu Leu Pro Arg Glu 
725 730 735 

Lys Tyr Leu Thr Trp Ala Ser Arg Gin Glu Pro Ser Gin Gly Thr Thr 
740 745 750 

Thr Tyr Ala Val Thr Ser lie Leu Arg Val Ala Ala Glu Asp Trp Lys 
755 760 765 

Lys Gly Glu Thr Phe Ser Cys Met Val Gly His Glu Ala Leu Pro Leu 
770 775 780 

Ala Phe Thr Gin Lys Thr lie Asp Arg Leu Ala Gly Lys Pro Thr His 
785 790 795 800 

lie Asn Val Ser Val Val Met Ala Glu Ala Asp Gly Thr Cys Tyr Arg 
805 810 815 

Ser Glu Lys Asp Glu Leu 
820 



<210> 49 

<400> 49 
000 



<210> 50 

<211> 159 

<212> PRT 

<213> Homo sapiens 

<400> 50 

Met Glu Asn His Leu Leu Phe Trp Gly Val Leu Ala Val Phe lie Lys 
15 10 15 

Ala Val His Val Lys Ala Gin Glu Asp Glu Arg lie Val Leu Val Asp 
20 25 30 

Asn Lys Cys Lys Cys Ala Arg lie Thr Ser Arg lie lie Arg Ser Ser 
35 40 45 

Glu Asp Pro Asn Glu Asp lie Val Glu Arg Asn lie Arg lie lie Val 
50 55 60 



Pro Leu Asn Asn Arg Glu Asn lie Ser Asp Pro Thr Ser Pro Leu Arg 
65 70 75 80 
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Thr Arg Phe Val Tyr His Leu Ser Asp Leu Cys Lys Lys Cys Asp Pro 
85 90 95 

Thr Glu Val Glu Leu Asp Asn Gin lie Val Thr Ala Thr Gin Ser Asn 
100 105 110 

lie Cys Asp Glu Asp Ser Ala Thr Glu Thr Cys Tyr Thr Tyr Asp Arg 
115 120 125 

Asn Lys Cys Tyr Thr Ala Val Val Pro Leu Val Tyr Gly Gly Glu Thr 
130 135 140 

Lys Met Val Glu Thr Ala Leu Thr Pro Asp Ala Cys Tyr Pro Asp 
145 150 155 



<210> 51 
<211> 602 
<212> PRT 

<213> Homo sapiens 
<400> 51 

Met Val Leu Phe Val Leu Thr Cys Leu Leu Ala Val Phe Pro Ala lie 
15 10 15 

Ser Thr Lys Ser Pro lie Phe Gly Pro Glu Glu Val Asn Ser Val Glu 
20 25 30 

Gly Asn Ser Val Ser lie Thr Cys Tyr Tyr Pro Pro Thr Ser Val Asn 
35 40 45 

Arg Thr Arg Lys Tyr Trp Cys Arg Gin Gly Ala Arg Gly Gly Cys lie 
50 55 60 

Thr Leu lie Ser Ser Glu Gly Tyr Val Ser Ser Lys Tyr Ala Gly Arg 
65 70 75 80 

Ala Asn Leu Thr Asn Phe Pro Glu Asn Gly Thr Phe Val Val Asn lie 
85 90 95 

Ala Gin Leu Ser Gin Asp Asp Ser Gly Arg Tyr Lys Cys Gly Leu Gly 
100 105 110 

lie Asn Ser Arg Gly Leu Ser Phe Asp Val Ser Leu Glu Val Ser Gin 
115 120 125 

Gly Pro Gly Leu Leu Asn Asp Thr Lys Val Tyr Thr Val Asp Leu Gly 
130 135 140 

Arg Thr Val Thr lie Asn Cys Pro Phe Lys Thr Glu Asn Ala Gin Lys 
145 150 155 160 

Arg Lys Ser Leu Tyr Lys Gin lie Gly Leu Tyr Pro Val Leu Val lie 
165 170 175 

Asp Ser Ser Gly Tyr Val Asn Pro Asn Tyr Thr Gly Arg lie Arg Leu 
180 185 190 



Asp lie Gin Gly Thr Gly Gin Leu Leu Phe Ser Val Val He Asn Gin 
195 200 205 
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Leu Arg Leu Ser 
210 

Ser Asn Ser Asn 
225 

Pro Glu Leu Val 



Leu Gly Pro Glu 
260 

Ser Gly Glu Asn 
275 

Pro Ala Phe Glu 
290 

Ser Phe Ser Val 
305 

Tyr Leu Cys Gly 



Gin Ala Trp Gin 
340 

Pro Thr Val Val 
355 

Pro Tyr Asn Arg 
370 

Glu Gly Ala Gin 
385 

Trp Val Lys Ala 



Gly Asn Gly Thr 
420 

Ala Gly Phe Tyr 
435 

Thr Val Glu lie 
450 

Gly Asn Val Thr 
465 

Pro Cys Lys Phe 



Thr Gly Cys Gin 
500 

Phe Val Asn Cys 
515 



Asp Ala Gly Gin 
215 

Lys Lys Asn Ala 
230 

Tyr Glu Asp Leu 
245 

Val Ala Asn Val 



Cys Asp Val Val 
280 

Gly Arg lie Leu 
295 

Val He Thr Gly 
310 

Ala Ser Asp Gly 
325 

Leu Phe Val Asn 



Lys Gly Val Ala 
360 

Lys Glu Ser Lys 
375 

Asn Gly Arg Cys 
390 

Gin Tyr Glu Gly 
405 

Phe Thr Val He 



Trp Cys Leu Thr 
440 

Lys He He Glu 
455 

Ala Val Leu Gly 
470 

Ser Ser Tyr Glu 
485 

Ala Leu Pro Ser 



Asp Glu Asn Ser 
520 



Tyr Leu Cys Gin 
22 0 

Asp Leu Gin Val 
235 

Arg Gly Ser Val 
250 

Ala Lys Phe Leu 
265 

Val Asn Thr Leu 



Leu Asn Pro Gin 
300 

Leu Arg Lys Glu 
315 

Gin Leu Gin Glu 
330 

Glu Glu Ser Thr 
345 

Gly Ser Ser Val 



Ser He Lys Tyr 
380 

Pro Leu Leu Val 
395 

Arg Leu Ser Leu 
410 

Leu Asn Gin Leu 
425 

Asn Gly Asp Thr 



Gly Glu Pro Asn 
460 

Glu Thr Leu Lys 
475 

Lys Tyr Trp Cys 
490 

Gin Asp Glu Gly 
505 

Arg Leu Val Ser 



Ala Gly Asp Asp 



Leu Lys Pro Glu 
240 

Thr Phe Cys Ala 
255 

Cys Arg Gin Ser 
270 

Gly Lys Arg Ala 
285 

Asp Lys Asp Gly 



Asp Ala Gly Arg 
320 

Gly Ser Pro He 
335 

He Pro Arg Ser 
350 

Ala Val Leu Cys 
365 

Trp Cys Leu Trp 



Asp Ser Glu Gly 
400 

Leu Glu Glu Pro 
415 

Thr Ser Arg Asp 
430 

Leu Trp Arg Thr 
445 

Leu Lys Val Pro 



Val Pro Cys Phe 
480 

Lys Trp Asn Asn 
495 

Pro Ser Lys Ala 
510 

Leu Thr Leu Asn 
525 
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Leu Val Thr Arg 
530 

Gly Phe Tyr Gly 
545 

Lys Ala Ala Gly 



Pro Asp Glu Lys 
580 

Ala lie Gin Asp 
595 



Ala Asp Glu Gly 
535 

Glu Thr Ala Ala 
550 

Ser Arg Asp Val 
565 

Val Leu Asp Ser 



Pro Arg Leu Phe 
600 



42/119 

Trp Tyr Trp Cys 
540 

Val Tyr Val Ala 
555 

Ser Leu Ala Lys 
570 

Gly Phe Arg Glu 
585 

Ala Glu 



Gly Val Lys Gin 



Val Glu Glu Arg 
560 

Ala Asp Ala Ala 
575 

lie Glu Asn Lys 
590 



<210> 52 

<211> 2533 

<212> DNA 

<213> Homo sapiens 



<400> 52 

ggtccaactg 

tctgtgctgg 

acagtgcatt 

gcagcaccca 

aggagccact 

cacccagcca 

ccacacagtg 

gccaggatgt 

accccgtcgt 

actcacacca 

gggcgaggcg 

gttccctcaa 

tgccaccccc 

gcgaacctca 

acgccctcaa 

tacagcgtgt 

acttgcactg 

ggtgggtcca 

ggcctgcctc 
tgaggcaggc 
tcatcccctg 
gaggtccacc 
acgtgcctgg 
caggagctgc 
accaccacct 
gacaccttct 
atcgaccgct 
gacggcacct 
ccaagcagcc 
gggaaagggc 
ctggcacccc 
gtggtgcagg 
gatgggttct 
ttgtccctga 
caggctagcc 
tccagggccc 
tcaggacgtc 
acggggcacg 
9 c 9gaacgtt 



caggcctgtg 
gttcctccag 
ctgtgttcca 
gccagatggg 
cagtgtgacc 
ggatgcctcc 
cctagccggc 
gactgtgccc 
gccctgacac 
gagtggaccc 
ggggcatggg 
ctccacctac 
gactgtcact 
cgtgcacact 
gtgggaagag 
ccagtgtcct 
ctgcctaccc 
gaccctgctc 
ctccctccca 
tgaggggcca 
cccagccccc 
tgctgccgcc 
cacgcggctt 
cccgcgagaa 
tcgctgtgac 
cctgcatggt 
tggcgggtaa 
gctactgagc 
ccacgcttcc 
cagggcatgg 
catgagactt 
aggcctcgct 
gagagacgca 
gagagccgcc 
aagaaaggcg 
cactgggagg 
ccactgacgc 
tgggcacagg 
cacagcccag 



gtgcaggagc 
tatagaggag 
gcatccccga 
aacgtggtca 
tggagcgaaa 

ggggacctgt 

aagtccgtga 
tgcccaggtc 
tgcgcctgca 
cgggccgagc 
gaagtatgtg 
cccatctccc 
gcaccgaccg 
gaccggcctg 
cgctgttcaa 
gccgggctgt 
cgagtccaag 
ggggccctgc 
atcctgggct 
ggacacagcc 
tgacctggct 
gccgtcggag 
cagccccaag 
gtacctgact 
cagcatactg 
gggccacgag 
acccacccat 
cgcccgcctg 
atccggcgcc 
acagggaaga 
tccaccctgg 
ggtgtgagat 
catgccccgg 
tctggctgta 
caggtagtgg 
acaagctcca 
atgcaggaag 
tgcacactca 
acccagttcc 



tgtgtgacca 
aggcagtata 
ccagccccaa 
tcgcctgcct 
gcggacaggg 
acaccacgag 
catgccacgt 
agagggcagg 
cccgtgttcc 
cccaggaggt 
ctgaccagct 
tcaactccac 
gccctcgagg 
agagatgcct 
ggaccacctg 
gccgagccat 
accccgctaa 
tcagtgctct 
ccagtgctca 
cggggtgccc 
ctctaccctc 
gagctggccc 
gacgtgctgg 
tgggcatccc 
cgcgtggcag 
gccctgccgc 
gtcaatgtgt 
tccccacccc 
tgtctgtcca 
ataccccctg 
tgtgagtgtg 
cttaggtctg 
acagttctga 
gctgggaggg 
caggagcggc 

gg a gggcccc 

gggcacctcc 
cactcacata 
agaaaagcca 



tggggctgtc 
gaggagaggg 
ggtcttcccg 
ggtccagggc 
cgtgaccgcc 
cagccagctg 
gaagcactac 
ctggggagtg 
ccacagggag 
ggtggtggac 
caggccatct 
ctaccccatc 
acctgctctt 
caggtgtcac 
agcgtgacct 
ggaaccatgg 
ccgccaccct 
ggtttgcaaa 
tgccaagtac 
accagagcag 
caggaaacac 
tgaacgagct 
ttcgctggct 
ggcaggagcc 
ccgaggactg 
tggccttcac 
ctgttgtcat 
tgaataaact 
tcctcagggt 
ccctgagcct 
agttgtgagt 
ccaaggcagg 
gtgagcagtg 
aatagggagg 
gagggagtga 
accaccctag 
ccttaaccac 
tatgcctgag 

ggggagtccc 



accaggcctc 
ccgcgtcctc 
ctgagcctct 
ttcttccccc 
agaaacttcc 
accctgccgg 
acgaatccca 
gggcggggcc 
ccgccccttc 
aggccaggag 
ctccactcca 
tccctcatgc 
aggttcagaa 
cttcacctgg 
ctgtggctgc 
gaagaccttc 
ctcaaaatcc 
gcatattcct 
acagggaaac 
aggggctctc 
attccggccc 
ggtgacgctg 
gcaggggtca 
cagccagggc 
gaagaagggg 
acagaagacc 
ggcggaggtg 
ccatgctccc 
ctcagcactt 
cggggggccc 
gtgagagtgt 
cacagcccag 
gcatggccgt 
gtaaaaggag 
ggggctggac 
tgggtgggcc 
actgctctgt 
ccctgcagga 
ctcccaagcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 
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cccaagctca gcctgctccc ctaggcccct ctggcttccc tgtgtttcca ctgtgcacag 2400 
atcaggcacc aactccacag acccctccca ggcagcccct gctccctgcc tggccaagtc 2460 
tcccatccct tcctaagccc aactaggacc caaagcatag acagggaggg gccacgtggg 252 0 
gtggcatcag aag 2533 



<210> 53 
<211> 2516 
<212> DNA 

<213> Homo sapiens 



<400> 53 

ggtccaaccg 

ctctgtgctg 

atggcatgag 

aaggtcttcc 

ctggtccagg 

aacgtgaccg 

agcagccagc 

gtgaagcact 

ggctggggag 

ccccacaggg 

gtggtggtgg 

ctcaggccat 

gctgcaccga 

actgaccggc 

gagcgctgtt 

cctgcctggc 

ccccgagttg 

ctcggggccc 

ccaatcctgg 

ccaggacaca 

ccctgacctg 

gccgccgtcg 

cttcagcccc 

gaagtacctg 

gaccagcata 

ggtgggccac 

taaacccacc 
agccgcccgc 
tccatccggc 
tggacaggga 
ctttccaccc 
gctggtgtga 
gcacatgccc 
gcctctggct 
gcgcaggtag 
aggacaagct 
cgcatgcagg 
acatgcacac 
cagacccagt 
cccccaggcc 
cagacccctc 
cccaactagg 



caggcccatg 
ggttcctcca 
ggccgcgtcc 
cgctgagcct 
gcttcttccc 
ccagaaactt 
tgaccctgcc 
acacgaatcc 
tggggcgggg 
agccgcccct 
acaggccagg 
ctctccactc 
ccggccctcg 
ctgagagatg 
caaggaccac 
tgtgcccagc 
aagaccccac 
tgctcagtgc 
gctccagtgc 
gcccagggtg 
gctctctacc 
gaggagctgg 
aaggatgtgc 
acttgggcat 
ctgcgcgtgg 
gaggccctgc 
catgtcaatg 
ctgtccccac 
gcctgtctgt 
agaatacccc 

tggtgtgagt 

gatcttaggt 
cggacagttc 
gtagctggga 
tggcaggagc 
ccaggagggc 
aaggggcacc 
tcacactcac 
tccagaaaag 
cctctggctt 
ccaggcagcc 
acccaaagca 



gtgcaggagc 
gtgtagagga 
tcacagcgca 
cgacagcacc 
ccaggagcca 
cccacctagc 
ggccacacag 
cagccaggat 
ccaccccgtc 
tcactcacac 
aggggcgagg 
cagttccccc 
aggacctgct 
cctctggtgc 
ctgagcgtga 
catggaacca 
taaccgccaa 
tctggtttgc 
tcatgccaag 
cccaccagag 
ctccaggaaa 
ccctgaacga 
tggttcgctg 
cccggcagga 
cagccgagga 
cgctggcctt 
tgtctgttgt 
ccctgaataa 
ccatcctcag 
ctgccctgag 
gtgagttgtg 
ctgccaaggc 
tgagtgagca 
gggaataggg 
ggcgagggag 
cccaccaccc 
tccccttaac 
atatacgcct 
ccaggggagt 
ccctgtgttt 
cctgctccct 
tagacaggga 



tgtgtaacct 
gaggcaggta 
ttctgtgttc 
ccccaagatg 
ctcagtgtga 
caggatgcct 
tgcccagacg 
gtgactgtgc 
ctgccctgac 
cagagtggac 

cgggggcacg 

acctccccca 
cttaggttca 
caccttcacc 
cctctgtggc 
tggggagacc 
catcacaaaa 
aaagcatatt 
tacacaggga 
cagaggggct 
cacattccgg 
gctggtgacg 
gctgcagggg 
gcccagccag 
ctggaagaag 
cacacagaag 
catggcggag 
actccatgct 
ggtctcagca 
cctcgggggg 
agtgtgagag 
aggcacagcc 
gtggcatggc 
agggtaaaag 
tgaggggctg 
tagtgggtgg 
cacactgctc 
gagccctgca 
cccctcccaa 
ccactgtgca 
gcctggccaa 
ggggccgcgt 



atggggctgt 
cagcctgtcc 
cagcatcccc 
ggaacgtggt 
cctggagcga 
ccggggacct 
gcaagtccgt 
cctgcccagg 
actgcgcctg 
cccgggccga 
gggaagggcg 
tgctgccacc 
gaagcgaacc 
tggacgccct 
tgctacagcg 
ttcacctgca 
tccggtgggt 
cccggcctgc 
aactgaggca 
ctctcatccc 
cccgaggtcc 
ctgacgtgcc 
tcacaggagc 
ggcaccacca 

ggggacacct 

accatcgacc 
gtggacggca 
cccccaagca 
cttgggaaag 
cccctggcac 
tgtgtggtgc 
caggatgggt 
cgtttgtccc 
gagcaggcta 
gactccaggg 
gcctcaggac 
tgtacggggc 
ggagtggaac 
gcccccaagc 
cagatcaggc 
gtctcccatc 
ggggtggcat 



caccaggcct 
tcctggggac 
gaccagcccc 
cgtcgcatgc 
aagcggacag 
gtacaccacg 
gacatgccac 
tcagagggca 
cacccgtgtt 
gccccaggag 
ttctgaccag 
cccgactgtc 
tcacgtgcac 
caagtgggaa 
tgtccagtgt 
ctgctgccca 
ccagaccctg 
ctcctccctc 
ggctgagggg 
ctgcccagcc 
acctgctgcc 
tggcacgtgg 
tgccccgcga 
ccttcgctgt 
tctcctgcat 
gcttggcggg 
cctgctactg 
gccccacgct 
ggccagggca 
ccccatgaga 
aggaggcctc 
tctgagagac 
tgagagagcc 
gccaagaaag 
ccccactggg 
gtcccactga 
acgtgggcac 
gttcacagcc 
tcagcctgct 
accaactcca 
ccttcctaag 
cagaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

24*00 

2460 

2516 



<210> 54 

<211> 2009 

<212> DNA 

<213> Homo sapiens 
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<400> 54 

agctttctgg 

ggcaggtggc 

cctcgcggac 

gcggtcacat 

ggcaccctcc 

ctacttcccc 

caccttcccg 

gccctccagc 

caccaaggtg 

aagcaggctc 

caaggcaggc 

gagagggtct 

caggccctgc 

aggaccctgc 

gacaccttct 

cttgtgacaa 

ccagctcaag 

ggtgctgaca 

gtcttcctct 

acatgcgtgg 

gacggcgtgg 

taccgggtgg 

aagtgcaagg 

aaaggtggga 

tgccctgaga 

gtgtacaccc 

ctggtcaaag 

gagaacaact 

agcaagctca 

atgcatgagg 

tgagtgcgac 

gcacgtaccc 

gccctgggcc 

tgagtggcat 



ggcaggccag 
gccagcaggt 
agttaagaac 
ggcaccacct 
tccaagagca 
gaaccggtga 
gctgtcctac 
agcttgggca 
gacaagaaag 
agcgctcctg 
cccgtctgcc 
tctggctttt 
acacaaaggg 
ccctgaccta 
ctcctcccag 
aactcacaca 
gcgggacagg 
cgtccacctc 
tccccccaaa 
tggtggacgt 
aggtgcataa 
tcagcgtcct 
tctccaacaa 
cccgtggggt 
gtgaccgctg 
tgcccccatc 
gcttctatcc 
acaagaccac 
ccgtggacaa 
ctctgcacaa 
ggccggcaag 
cctgtacata 
cctgcgagac 
gagggaggca 



gcctgacctt 
gcacacccaa 
ccaggggcct 
ctcttgcagc 
cctctggggg 

cggtgtcgtg 

agtcctcagg 
cccagaccta 
ttggtgagag 
cctggacgca 
tcttcacccg 
tcccaggctc 
gcaggtgctg 
agcccacccc 
attccagtaa 
tgcccaccgt 
tgccctagag 
catctcttcc 
acccaaggac 
gagccacgaa 
tgccaagaca 
caccgtcctg 
agccctccca 
gcgagggcca 
taccaacctc 
ccgggatgag 
cagcgacatc 
gcctcccgtg 
gagcaggtgg 
ccactacacg 
ccccgctccc 
cttcccgggc 
tgtgatggtt 
gagcgggtc 



44/119 

ggctttgggg 
tgcccatgag 
ctgcgcctgg 
ctccaccaag 
cacagcggcc 
gaactcaggc 
actctactcc 
catctgcaac 
gccagcacag 
tcccggctat 
gagcctctgc 
tgggcaggca 
ggctcagacc 
aaaggccaaa 
ctcccaatct 
gcccaggtaa 
tagcctgcat 
tcagcacctg 
accctcatga 
gaccctgagg 
aagccgcggg 
caccaggact 
gcccccatcg 
catggacaga 
tgtcctacag 
ctgaccaaga 
gccgtggagt 
ctggactccg 
cagcagggga 
cagaagagcc 
cgggctctcg 
gcccagcatg 
ctttccacgg 



cagggagggg 
cccagacact 
gcccagctct 
ggcccatcgg 
ctgggctgcc 
gccctgacca 
ctcagcagcg 
gtgaatcaca 
ggagggaggg 
gcagccccag 
ccgccccact 
caggctaggt 
tgccaagagc 
ctctccactc 
tctctctgca 
gccagcccag 
ccagggacag 
aactcctggg 
tctcccggac 
tcaagttcaa 
aggagcagta 
ggctgaatgg 
agaaaaccat 
ggccggctcg 
ggcagccccg 
accaggtcag 
gggagagcaa 
acggctcctt 
acgtcttctc 
tctccctgtc 
cggtcgcacg 
gaaataaagc 
gtcaggccga 



gctaaggtga 
ggacgctgaa 
gtcccacacc 
tcttccccct 
tggtcaagga 
gcggcgtgca 
tggtgaccgt 
agcccagcaa 
tgtctgctgg 
tccagggcag 
catgctcagg 
gcccctaacc 
catatccggg 
cctcagctcg 
gagcccaaat 
gcctcgccct 
gccccagccg 
gggaccgtca 
ccctgaggtc 
ctggtacgtg 
caacagcacg 
caaggagtac 
ctccaaagcc 
gcccaccctc 
agaaccacag 
cctgacctgc 
tgggcagccg 
cttcctctac 
atgctccgtg 
tccgggtaaa 
aggatgcttg 
acccagcgct 
gtctgaggcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2009 



<210> 55 

<211> 2009 

<212> DNA 

<213> Homo sapiens 



<400> 55 

agctttctgg 

ggcaggtggc 

ctggaccctc 

cacaccgcgg 

ccccctggcg 

caaggactac 

cgtgcacacc 

gaccgtgccc 

cagcaacacc 

tgctggaagc 

agggcagcaa 

tgctcaggga 

gcccctaccc 

catatccggg 

cctcagctcg 

gagcgcaaat 

tccagctcaa 

gggtgctgac 



ggcgagccgg 
gccagccagg 
gtggatagac 
tcacatggca 
ccctgctcca 
ttccccgaac 
ttcccagctg 
tccagcaact 
aaggtggaca 
caggctcagc 
ggcaggcccc 
gagggtcttc 
caggcccttc 
aggaccctgc 
gacaccttct 
gttgtgtcga 
ggcgggacag 
acgtccacct 



gcctgacttt 
tgcacaccca 
aagaaccgag 
ccacctctct 
ggagcacctc 
cggtgacggt 
tcctacagtc 
tcggcaccca 
agacagttgg 
cctcctgcct 
atctgtctcc 
tggctttttc 
acacacaggg 
ccctgaccta 
ctcctcccag 
gtgcccaccg 
gtgccctaga 
ccatctcttc 



ggctttgggg 
atgcccgtga 
gggcctctgc 
tgcagcctcc 
cgagagcaca 
gtcgtggaac 
ctcaggactc 
gacctacacc 
tgagaggcca 
ggacgcaccc 
tcacccggag 
caccaggctc 
gcaggtgctt 
agccgacccc 
atccgagtaa 
tgcccaggta 
gtagcctgca 
ctcagcacca 



cagggagtgg 
gcccagacac 
gcctgggccc 
accaagggcc 
gccgccctgg 
tcaggcgctc 
tactccctca 
tgcaacgtag 
gctcagggag 
cggctgtgca 
gcctctgccc 
caggcaggca 
ggctcagacc 
aaaggccaaa 
ctcccaatct 
agccagccca 
tccagggaca 
cctgtggcag 



gctaaggtga 
tggaccctgc 
agctctgtcc 
catcggtctt 
gctgcctggt 
tgaccagcgg 
gcagcgtggt 
atcacaagcc 
ggagggtgtc 
gccccagccc 
gccccactca 
caggctgggt 
tgccaaaagc 
ctgtccactc 
tctctctgca 
ggcctcgccc 
ggccccagct 
gaccgtcagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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PCT/US02/34197 



cttcctcttc 
gtgcgtggtg 
cggcgtggag 
ccgtgtggtc 
gtgcaaggtc 
aggtgggacc 
ccctgggagt 
tgtacaccct 
tggtcaaagg 
agaacaacta 
gcaagctcac 
tgcatgaggc 
gagtgccacg 
gcacgtaccc 
gccctgggcc 
tgagtggcat 



cccccaaaac 
gtggacgtga 
gtgcataatg 
agcgtcctca 
tccaacaaag 
cgcggggtat 
gaccgctgtg 
gcccccatcc 
cttctacccc 
caagaccaca 
cgtggacaag 
tctgcacaac 
gccggcaagc 
cgtgtacata 
cctgcgagac 
gagggaggca 



ccaaggacac 
gccacgaaga 
ccaagacaaa 
ccgttgtgca 
gcctcccagc 
gagggccaca 
ccaacctctg 
cgggaggaga 
agcgacatcg 
cctcccatgc 
agcaggtggc 
cactacacgc 
ccccgctccc 
cttcccaggc 
tgtgatggtt 
gagtgggtc 



45/119 

cctcatgatc 
ccccgaggtc 
gccacgggag 
ccaggactgg 
ccccatcgag 
tggacagagg 
tccctacagg 
tgaccaagaa 
ccgtggagtg 
tggactccga 
agcaggggaa 
agaagagcct 
caggctctcg 
acccagcatg 
ctttccgtgg 



tcccggaccc 
cagttcaact 
gagcagttca 
ctgaacggca 
aaaaccatct 
ccggctcggc 
gcagccccga 
ccaggtcagc 
ggagagcaat 
cggctccttc 
cgtcttctca 
ctccctgtct 
gggtcgcgtg 
gaaataaagc 
gtcaggccga 



ctgaggtcac 
ggtacgtgga 
acagcacgtt 
a ggagtacaa 
ccaaaaccaa 
ccaccctctg 
gaaccacagg 
ctgacctgcc 
gggcagccgg 
ttcctctaca 
tgctccgtga 
ccgggtaaat 
aggatgcttg 
acccagcgct 
gtctgaggcc 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2009 



<210> 56 

<211> 2590 

<212> DNA 

<213> Homo sapiens 



<400> 56 

agctttctgg 

gcaggtggcg 

tggaccctcg 

cacaccgcag 

ccccctggcg 

caaggactac 

cgtgcacacc 

gaccgtgccc 

cagcaacacc 

tgctggaagc 

agggcagcaa 

gctcagggag 

cccctacccc 

atatccagga 

ctcagctcag 

agctcaaaac 

cccaggactc 

gacaggcccc 

tcttctctct 

taagccagcc 

agtccaggga 

actcccaatc 

tgcccaggta 

gtggcctgag 

tccccgtaac 

gcccaaggtg 

gccctagagt 

atctcttcct 

cccaaggata 

agccacgaag 

gccaagacaa 

accgtcctgc 

gccctcccag 

tgagggccac 

gccaacctct 

ccgggaggag 

cagcgacatc 



ggcaggccag 
ccagccaggc 
tggatagaca 
tcacatggcg 
ccctgctcca 
ttccccgaac 
ttcccggctg 
tccagcagct 
aaggtggaca 
caggctcagc 
ggcaggcccc 
agggtcttct 
aggcccttca 
ggaccctgcc 
acaccttctc 
cccacttggt 
gccctccagc 
agcagggtgc 
gcagagccca 
caggcctcac 
caggccccag 
ttctctctgc 
agccagccca 
tccagggaca 
tcccaatctt 
cccaggtaag 
ggcctgcatc 
cagcacctga 
cccttatgat 
accccgaggt 
agccgcggga 
accaggactg 
cccccatcga 
atggacagag 
gtccctacag 
atgaccaaga 
gccgtggagt 



gcctgacttt 
gcacacccaa 
agaaccgagg 
ccatctctct 
ggagcacctc 
cggtgacggt 
tcctacagtc 
tgggcaccca 
agagagttgg 
cctcctgcct 
gtctgactcc 
ggctttttcc 
cacacagggg 
cctgacctaa 
tcttcccaga 
gacacaactc 
tcaaggcggg 
tgacgcatcc 
aatcttgtga 
cctccagctc 
cagggtgctg 
agagcccaaa 
ggcctcgccc 
ggccccagca 
ctctctgcag 
ccagcccagg 
cagggacagg 
actcctggga 
ttcccggacc 
ccagttcaag 
ggagcagtac 
gctgaacggc 
gaaaaccatc 
gccagcttga 
gacagccccg 
accaggtcag 
999 a 9 a 9cag 



ggctgggggc 
tgcccgtgag 
ggcctctgcg 
tgcagcttcc 
tgggggcaca 
gtcgtggaac 
ctcaggactc 
gacctacacc 
tgagaggcca 
ggacgcatcc 
tcacccggag 
accaggctcc 
caggtgctgc 
gcccacccca 
tctgagtaac 
acacatgccc 
acaagagccc 
acctccatcc 
cacacctccc 
aaggcaggac 
acgcgtccac 
tcttgtgaca 
tccagctcaa 
gggtgctgac 
agcccaaatc 
cctcgccctc 
tcccagtcgg 
ggaccgtcag 
cctgaggtca 
tggtacgtgg 
aacagcacgt 
aaggagtaca 
tccaaaacca 
cccaccctct 
agaaccacag 
cctgacctgc 
cgggcagccg 



a ggg a ggggg 

cccagacact 
ccctgggccc 
accaagggcc 
gcggccctgg 
tcaggcgccc 
tactccctca 
tgcaacgtga 
gcgcagggag 
cggctgtgca 
cctctgcccg 
gggcaggcac 
gctcagagct 
aaggccaaac 
tcccaatctt 
acggtgccca 
tagagtggcc 
cagatccccg 
ccgtgcccac 
aagagcccta 
ctccatccca 
cacctccccc 
ggcgggacaa 
gcatccacct 
ttgtgacaca 
cagctcaagg 
gtgctgacac 
tcttcctctt 
cgtgcgtggt 
acggcgtgga 
tccgtgtggt 
agtgcaaggt 
aaggtgggac 
gccctgggag 
gtgtacaccc 
ctggtcaaag 
gagaacaact 



ctaaggtgac 
ggaccctgcc 
agctctgtcc 
catcggtctt 
gctgcctggt 
tgaccagcgg 
gcagcgtggt 
atcacaagcc 
ggagggtgtc 
gtcccagccc 
ccccactcat 
aggctggatg 
gccaaaagcc 
tctctactca 
ctctctgcag 
ggtaagccag 
tgagtccagg 
taactcccaa 
ggtgcccagg 
gagtggcctg 
gatccccgta 
atgcccacgg 
gagccctaga 
ccatcccaga 
cctcccccgt 
caggacaggt 
atctgcctcc 
ccccccaaaa 
ggtggacgtg 
ggtgcataat 
cagcgtcctc 
ctccaacaaa 
ccgcggggta 
tgaccgctgt 
tgcccccatc 
gcttctaccc 
acaacaccac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 
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gcctcccatg 
gagcaggtgg 
ccgcttcacg 
cccccgctcc 
acttcccggg 
ctgtgatggt 
agagcgggtc 



ctggactccg 
cagcagggga 
cagaagagcc 
ccgggctctc 
cacccagcat 
tctttccacg 



acggctcctt 
acatcttctc 
tctccctgtc 
ggggtcgcgc 
ggaaataaag 
ggtcaggccg 



46/119 

cttcctctac 
atgctccgtg 
tccgggtaaa 
gaggatgctt 
cacccagcgc 
agtctgaggc 



agcaagctca 
atgcatgagg 
tgagtgcgac 
ggcacgtacc 
tgccctgggc 
ctgagtgaca 
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ccgtggacaa 2280 
ctctgcacaa 2340 
agccggcaag 2400 
ccgtgtacat 2460 
ccctgtgaga 2520 
tgagggaggc 2 5 80 
2590 



<210> 57 

<211> 2028 

<212> DNA 

<213> Homo sapiens 



<400> 57 

agctttctgg 

gcaggtggcg 

tggaccatcg 

cacaccgcgg 

ccccctggcg 

caaggactac 

cgtgcacacc 

gaccgtgccc 

cagcaacacc 

tgctggaagc 

agggcagcaa 

tgctcaggga 

cccctacccc 

tatccgggag 

tcagctcaga 

gtccaaatat 

cagctcaagg 

gtgctgacgc 

tcttcctgtt 

cgtgcgtggt 

atggcgtgga 

accgtgtggt 

agtgcaaggt 

aaggtgggac 

gccctgggag 

gtgtacaccc 

ctggtcaaag 

gagaacaact 

agcaggctaa 

atgcatgagg 

tgagtgccag 

ggcacgtacc 

tgccctgggc 

ctgagtgaca 



ggcaggccgg 
ccagccaggt 
cggatagaca 
tcacatggca 
ccctgctcca 
ttccccgaac 
ttcccggctg 
tccagcagct 
aaggtggaca 
caggctcagc 
ggcatgcccc 
gagggtcttc 
aggccctgcg 
gaccctgccc 
caccttctct 
ggtcccccat 
cgggacaggt 
atccacctcc 
ccccccaaaa 
ggtggacgtg 
ggtgcataat 
cagcgtcctc 
ctccaacaaa 
ccacggggtg 
tgaccgctgt 
tgcccccatc 
gcttctaccc 
acaagaccac 
ccgtggacaa 
ctctgcacaa 
ggccggcaag 
ccgtctacat 
ccctgtgaga 
tgagggaggc 



gcctgacttt 
gcacacccaa 
agaaccgagg 
ccacctctct 
ggagcacctc 
cggtgacggt 
tcctacagtc 
tgggcacgaa 
agagagttgg 
cctcctgcct 
atctgtctcc 
tggatttttc 
catacagggc 
ctgacctaag 
cctcccagat 
gcccatcatg 
gccctagagt 
atctcttcct 
cccaaggaca 
agccaggaag 
gccaagacaa 
accgtcctgc 
ggcctcccgt 
cgagggccac 
gccaacctct 
ccaggaggag 
cagcgacatc 
gcctcccgtg 
gagcaggtgg 
ccactacaca 
cccccgctcc 
acttcccagg 
ctgtgatggt 
agagcgggtc 



ggctgggggc 
tgcccatgag 
ggcctctgcg 
tgcagcttcc 
cgagagcaca 
gtcgtggaac 
ctcaggactc 
gacctacacc 
tgagaggcca 
ggacgcaccc 
tcacccggag 
caccaggctc 
a 99tgctgcg 
cccaccccaa 
ctgagtaact 
cccaggtaag 
agcctgcatc 
cagcacctga 
ctctcatgat 
accccgaggt 
agccgcggga 
accaggactg 
cctccatcga 
acggacagag 
gtccctacag 
atgaccaaga 
gccgtggagt 
ctggactccg 
caggagggga 
cagaagagcc 
ccgggctctc 
cacccagcat 
tctttccacg 
ccactgtccc 



agggaggggg 
cccagacact 
ccctgggccc 
accaagggcc 
gccgccctgg 
tcaggcgccc 
tactccctca 
tgcaacgtag 
gcacagggag 
cggctgtgca 
gcctctgacc 
ccggcaccac 
ctcagacctg 
aggccaaact 
cccaatcttc 
ccaacccagg 
cagggacagg 
gttcctgggg 
ctcccggacc 
ccagttcaac 
ggagcagttc 
gctgaacggc 
gaaaaccatc 
gccagctcgg 
ggcagccccg 
accaggtcag 

999 a 9 a 9 caa 
acggctcctt 
atgtcttctc 
tctccctgtc 
ggggtcgcgc 
ggaaataaag 
ggtcaggccg 
cacactgg 



ctaaggtgac 
ggaccctgca 
agctctgtcc 
catccgtctt 
gctgcctggt 
tgaccagcgg 
gcagcgtggt 
atcacaagcc 
ggagggtgtc 
gccccagccc 
accccactca 
aggctggatg 
ccaagagcca 
ctccactccc 
tctctgcaga 
cctcgccctc 
ccccagccgg 
ggaccatcag 
cctgaggtca 
tggtacgtgg 
aacagcacgt 
aaggagtaca 
tccaaagcca 
cccaccctct 
agagccacag 
cctgacctgc 

tgggcagccg 

cttcctctac 
atgctccgtg 
tctgggtaaa 
gaggatgctt 
cacccaccac 
agtctgaggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2028 



<210> 58 

<211> 106 

<212> DNA 

<213> Homo sapiens 

<400> 58 

tgccacccca ggactctgtc ttccagcacc caccaaggct ccggatgtgt tccccatcat 60 
atcagggtgc agacacccaa aggataacag ccctgtggtc ctggca 106 



WO 03/064992 



<210> 59 

<211> 1725 

<212> DNA 

<213> Homo sapiens 



47/119 



PCT/US02/34197 



<400> 59 

atggactgga 

cagttggtgc 

aaggcttctg 

cacgggcttg 

agatttcagg 

ctgagaagtc 

tggagtgatt 

acggtcaccg 

tgcaaaaaca 

ttcccggagc 

ttaccagcca 

tcgggtgcgt 

gactgggtcg 

aagatcttac 

tgcctcgtct 

gtcatggacg 

caaagcgagc 

gtcacctatc 

agaggggtga 

cccacgatca 

tggtcccggg 

aatggcacgt 

gagacctacc 

accaagacca 

ccggggagcc 

atctcggtgc 
cagccccgca 
gccgaatggg 
tcacagaccg 



cctggatcct 
agtctggggc 
gatacacctt 
agtgggtggg 
gcagggtcac 
tgagatctga 
attataactt 
tctcctcagc 
ttccctccaa 
cggtgatggt 
ccaccctcac 
gggccaagca 
acaacaaaac 
agtcgtcctg 
ctgggtacac 
tggacttgtc 
tcaccctcag 
aaggtcacac 
gcgcctacct 
cctgtctggt 
ccagtgggaa 
taaccgtcac 
agtgcagggt 
gcggcccgcg 
gggacaagcg 
agtggctgca 
agaccaaggg 
agcagaaaga 
tccagcgagc 



cttcttggtg 
tgaggtgagg 
catcgactcc 
atggatcaac 
catgaccaga 
cgactcggcc 
tgactactcg 
ctccacacag 
tgccacctcc 
gacctgggac 
gctctctggt 
gatgttcacc 
cttcagcgtc 
cgacggcggc 
cccagggact 
caccgcctct 
ccagaagcac 
ctttgaggac 
aagccggccc 
ggtggacctg 
gcctgtgaac 
gtccaccctg 
gacccacccc 
tgctgccccg 
caccctcgcc 
caacgaggtg 
ctccggcttc 
tgagttcatc 
ggtgtctgta 



gcagcagcca 
aagcctgggg 
tatatccact 
cctaacagtg 
gacgcgtcct 
gtgttttact 
tacactttgg 
agcccatccg 
gtgactctgg 
acaggctccc 
cactatgcca 
tgccgtgtgg 
tgctccaggg 
gggcacttcc 
atcaacatca 
accacgcagg 
tggctgtcag 
agcaccaaga 
agcccgttcg 
gcacccagca 
cactccacca 
ccggtgggca 
cacctgccca 
gaagtctatg 
tgcctgatcc 
cagctcccgg 
ttcgtcttca 
tgccgtgcag 
aatcccggta 



cgcgagtcca 
catcagtgag 
ggatacgaca 
gtggcacaaa 
tcagtacagc 
gtgcgaaaag 
acgtctgggg 
tcttcccctt 
gctgcctggc 
tcaacgggac 
ccatcagctt 
cacacactcc 
acttcacccc 
ccccgaccat 
cctggctgga 

agggtgagct 

accgcaccta 
agtgtgcaga 
acctgttcat 
aggggaccgt 
gaaaggagga 
cccgagactg 
gggccctcat 
cgtttgcgac 
agaacttcat 
acgcccggca 
gccgcctgga 
tccatgaggc 
aatga 



ctcccagacg 
ggtctcctgc 
ggcccctggg 
ctatgctccg 
ctacatggac 
tgaccctttt 
ccaagggacc 
gacccgctgc 
cacgggctac 
aactatgacc 
gctgaccgtc 
atcgtccaca 
gcccaccgtg 
ccagctcctg 
ggacgggcag 
ggcctccaca 
cacctgccag 
ttccaacccg 
ccgcaagtcg 
gaacctgacc 
gaagcagcgc 
gatcgagggg 
gcggtccacg 
gccggagtgg 
gcctgaggac 
cagcacgacg 
ggtgaccagg 
agcgagcccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1725 



<210> 60 

<211> 428 

<212> PRT 

<213> Homo sapiens 

<400> 60 

Ala Ser Thr Gin Ser Pro Ser Val Phe Pro Leu Thr Arg Cys Cys Lys 
15 10 15 

Asn lie Pro Ser Asn Ala Thr Ser Val Thr Leu Gly Cys Leu Ala Thr 
20 25 30 

Gly Tyr Phe Pro Glu Pro Val Met Val Thr Trp Asp Thr Gly Ser Leu 
35 40 45 

Asn Gly Thr Thr Met Thr Leu Pro Ala Thr Thr Leu Thr Leu Ser Gly 
50 55 60 

His Tyr Ala Thr lie Ser Leu Leu Thr Val Ser Gly Ala Trp Ala Lys 
65 70 75 80 



Gin Met Phe Thr Cys Arg Val Ala His Thr Pro Ser Ser Thr Asp Trp 
85 90 95 
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Val Asp Asn Lys Thr 
100 

Thr Val Lys lie Leu 
115 

Pro Thr lie Gin Leu 
130 

lie Asn lie Thr Trp 
145 

Ser Thr Ala Ser Thr 
165 

Glu Leu Thr Leu Ser 
180 

Cys Gin Val Thr Tyr 
195 

Cys Ala Asp Ser Asn 
210 

Ser Pro Phe Asp Leu 
225 

Val Val Asp Leu Ala 
245 

Arg Ala Ser Gly Lys 
260 

Gin Arg Asn Gly Thr 
275 

Arg Asp Trp lie Glu 
290 

His Leu Pro Arg Ala 
305 

Arg Ala Ala Pro Glu 
325 

Ser Arg Asp Lys Arg 
340 

Glu Asp lie Ser Val 
355 

Ala Arg His Ser Thr 
370 

Phe Val Phe Ser Arg 
385 

Asp Glu Phe lie Cys 
405 

Thr Val Gin Arg Ala 



48/119 

Phe Ser Val Cys Ser 
105 

Gin Ser Ser Cys Asp 
120 

Leu Cys Leu Val Ser 
135 

Leu Glu Asp Gly Gin 
150 

Thr Gin Glu Gly Glu 
170 

Gin Lys His Trp Leu 
185 

Gin Gly His Thr Phe 
200 

Pro Arg Gly Val Ser 
215 

Phe lie Arg Lys Ser 
230 

Pro Ser Lys Gly Thr 
250 

Pro Val Asn His Ser 
265 

Leu Thr Val Thr Ser 
280 

Gly Glu Thr Tyr Gin 
295 

Leu Met Arg Ser Thr 
310 

Val Tyr Ala Phe Ala 
330 

Thr Leu Ala Cys Leu 
345 

Gin Trp Leu His Asn 
360 

Thr Gin Pro Arg Lys 
375 

Leu Glu Val Thr Arg 
390 

Arg Ala Val His Glu 
410 

Val Ser Val Asn Pro 



Arg Asp Phe Thr Pro 
110 

Gly Gly Gly His Phe 
125 

Gly Tyr Thr Pro Gly 
140 

Val Met Asp Val Asp 
155 

Leu Ala Ser Thr Gin 
175 

Ser Asp Arg Thr Tyr 
190 

Glu Asp Ser Thr Lys 
205 

Ala Tyr Leu Ser Arg 
220 

Pro Thr lie Thr Cys 
235 

Val Asn Leu Thr Trp 
255 

Thr Arg Lys Glu Glu 
270 

Thr Leu Pro Val Gly 
285 

Cys Arg Val Thr His 
300 

Thr Lys Thr Ser Gly 
315 

Thr Pro Glu Trp Pro 
335 

lie Gin Asn Phe Met 
350 

Glu Val Gin Leu Pro 
365 

Thr Lys Gly Ser Gly 
380 

Ala Glu Trp Glu Gin 
395 

Ala Ala Ser Pro Ser 
415 

Gly Lys 
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Pro 

Pro 

Thr 

Leu 
160 

Ser 

Thr 

Lys 

Pro 

Leu 
240 

Ser 

Lys 

Thr 

Pro 

Pro 
320 

Gly 

Pro 

Asp 

Phe 

Lys 
400 

Gin 
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420 



49/119 
425 
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<210> 61 

<211> 1884 

<212> DNA 

<213> Homo sapiens 



<400> 61 

atggactgga 

gtgcagctgg 

tgcaaggctt 

ggacaagggc 

cagaagttcc 

gagctgagca 

ctggggccgt 

gtctggggcc 

ttccccctcg 

ctcgcacagg 

gacatcagca 

tcacaggtgc 

aaagtccagc 

ctgcctccca 

agcaagtcca 

tggctgcgcg 

gccaaagagt 

gactggctca 

cagaatgcgt 

cccccatcct 

gacctgacca 

aaaacccaca 

gccagcatct 

acagacctgc 

aggcccgatg 

accatcacgt 

agggggcagc 

gccccaggcc 

ggggagacct 

accgtggaca 

tgggccaccg 

gtcaccttgt 



cctggaggtt 
tgcagtctgg 
ctggaggcac 
ttgagtggat 
agggcagagt 
gcctgagatc 
atagcagtgg 
aagggaccac 
tctcctgtga 
acttccttcc 
gcacccgggg 
tgctgccttc 
accccaacgg 
aagtgagcgt 
agctcatctg 
aggggaagca 
ctgggcccac 
gccagagcat 
cctccatgtg 
ttgccagcat 
cctatgacag 
ccaacatctc 
gcgaggatga 
cctcgccact 
tctacttgct 
gcctggtgac 
ccttgtcccc 
ggtacttcgc 
acacctgcgt 
agtccaccga 
cctccacctt 
tcaaggtgaa 



cctctttgtg 
ggctgaggtg 
cttcagcagc 
gggagggatc 

cacgattacc 
tgaggacacg 
ctggtacccg 
ggtcaccgtc 
gaattccccg 
cgactccatc 
cttcccatca 
caaggacgtc 
caacaaagaa 
cttcgtccca 
ccaggccacg 
ggtggggtct 
gacctacaag 
gttcacctgc 
tgtccccgat 
cttcctcacc 
cgtgaccatc 
cgagagccac 
ctggaattcc 
gaagcagacc 
gccaccagcc 
gggcttctct 
ggagaagtat 
ccacagcatc 
ggtggcccat 
gggggaggtg 
catcgtcctc 
atga 



gtggcagcag 
aagaagcctg 
tatgctatca 
atccctatct 
gcggacgaat 
gccgtgtatt 
aactcggact 
tcctcaggga 
tcggatacga 
actttctcct 
gtcctgagag 
atgcagggca 
aagaacgtgc 
ccccgcgacg 
ggtttcagtc 
ggcgtcacca 
gtgaccagca 
cgcgtggatc 
caagacacag 
aagtccacca 
tcctggaccc 
cccaatgcca 
ggggagaggt 
atctcccggc 
cgggagcagc 
cccgcggacg 
gtgaccagcg 
ctgaccgtgt 
gaggccctgc 
agcgccgacg 
ttcctcctga 



ctacaggtgt 
ggtcctcggt 
gctgggtgcg 
ttggtacagc 
ccacgagcac 
actgtgcgaa 
actactacta 
gtgcatccgc 
gcagcgtggc 
ggaaatacaa 
ggggcaagta 
cagacgaaca 
ctcttccagt 
gcttcttcgg 
cccggcagat 
cggaccaggt 
cactgaccat 
acaggggcct 
ccatccgggt 
agttgacctg 
gccagaatgg 
ctttcagcgc 
tcacgtgcac 
ccaagggggt 
tgaacctgcg 
tcttcgtgca 
ccccaatgcc 
ccgaagagga 
ccaacagggt 
aggagggctt 
gcctcttcta 



ccagtcccag 
gaaggtctcc 
acaggcccct 
aaactacgca 
agcctacatg 
aaccgggatc 
cggtatggac 
cccaaccctt 
cgttggctgc 
gaacaactct 
cgcagccacc 
cgtggtgtgc 
gattgctgag 
caacccccgc 
tcaggtgtcc 
gcaggctgag 
caaagagagc 
gaccttccag 
cttcgccatc 
cctggtcaca 
cgaagctgtg 
cgtgggtgag 
cgtgacccac 
ggccctgcac 
ggagtcggcc 
gtggatgcag 
tgagccccag 
atggaacacg 
caccgagagg 
tgagaacctg 
cagtaccacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1884 



<210> 62 

<211> 454 

<212> PRT 

<213> Homo sapiens 

<400> 62 

Gly Ser Ala Ser Ala Pro Thr Leu Phe Pro Leu Val Ser Cys Glu Asn 
15 10 15 

Ser Pro Ser Asp Thr Ser Ser Val Ala Val Gly Cys Leu Ala Gin Asp 
20 25 30 

Phe Leu Pro Asp Ser lie Thr Phe Ser Trp Lys Tyr Lys Asn Asn Ser 
35 40 45 

Asp lie Ser Ser Thr Arg Gly Phe Pro Ser Val Leu Arg Gly Gly Lys 
50 55 60 
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Tyr Ala Ala Thr 
65 

Gly Thr Asp Glu 



Lys Glu Lys Asn 
100 

Val Ser Val Phe 
115 

Ser Lys Ser Lys 
130 

lie Gin Val Ser 
145 

Thr Thr Asp Gin 



Tyr Lys Val Thr 
180 

Gin Ser Met Phe 
195 

Gin Asn Ala Ser 
210 

Val Phe Ala He 
225 

Thr Lys Leu Thr 



Thr He Ser Trp 
260 

Asn He Ser Glu 
275 

Ala Ser He Cys 
290 

Thr Val Thr His 
305 

Arg Pro Lys Gly 



Pro Ala Arg Glu 
340 

Leu Val Thr Gly 
355 

Arg Gly Gin Pro 
370 



Ser Gin Val Leu 
70 

His Val Val Cys 
85 

Val Pro Leu Pro 



Val Pro Pro Arg 
120 

Leu He Cys Gin 
135 

Trp Leu Arg Glu 
150 

Val Gin Ala Glu 
165 

Ser Thr Leu Thr 



Thr Cys Arg Val 
200 

Ser Met Cys Val 
215 

Pro Pro Ser Phe 
230 

Cys Leu Val Thr 
245 

Thr Arg Gin Asn 



Ser His Pro Asn 
280 

Glu Asp Asp Trp 
295 

Thr Asp Leu Pro 
310 

Val Ala Leu His 
325 

Gin Leu Asn Leu 



Phe Ser Pro Ala 
360 

Leu Ser Pro Glu 
375 
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Leu Pro Ser Lys 
75 

Lys Val Gin His 
90 

Val He Ala Glu 
105 

Asp Gly Phe Phe 



Ala Thr Gly Phe 
140 

Gly Lys Gin Val 
155 

Ala Lys Glu Ser 
170 

He Lys Glu Ser 
185 

Asp His Arg Gly 



Pro Asp Gin Asp 
220 

Ala Ser He Phe 
235 

Asp Leu Thr Thr 
250 

Gly Glu Ala Val 
265 

Ala Thr Phe Ser 



Asn Ser Gly Glu 
300 

Ser Pro Leu Lys 
315 

Arg Pro Asp Val 
330 

Arg Glu Ser Ala 
345 

Asp Val Phe Val 



Lys Tyr Val Thr 
380 



Asp Val Met Gin 
80 

Pro Asn Gly Asn 
95 

Leu Pro Pro Lys 
110 

Gly Asn Pro Arg 
125 

Ser Pro Arg Gin 



Gly Ser Gly Val 
160 

Gly Pro Thr Thr 
175 

Asp Trp Leu Ser 
190 

Leu Thr Phe Gin 
205 

Thr Ala He Arg 



Leu Thr Lys Ser 
240 

Tyr Asp Ser Val 
255 

Lys Thr His Thr 
270 

Ala Val Gly Glu 
285 

Arg Phe Thr Cys 



Gin Thr He Ser 
320 

Tyr Leu Leu Pro 
335 

Thr He Thr Cys 
350 

Gin Trp Met Gin 
365 

Ser Ala Pro Met 



Pro Glu Pro Gin Ala Pro Gly Arg Tyr Phe Ala His Ser He Leu Thr 
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395 
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400 



Val Ser Glu Glu 



Ala His Glu Ala 
420 

Ser Thr Gly Lys 
435 

Thr Ala Gly Thr 
450 



Glu Trp Asn Thr 
405 

Leu Pro Asn Arg 

Pro Thr Leu Tyr 
440 

Cys Tyr 



Gly Glu Thr Tyr 
410 

Val Thr Glu Arg 
425 

Asn Val Ser Leu 



Thr Cys Val Val 
415 

Thr Val Asp Lys 
430 

Val Met Ser Asp 
445 



<210> 63 

<211> 532 

<212> PRT 

<213> Homo sapiens 

<400> 63 * 

Met Ala Pro Ser Ser Pro Arg Pro Ala Leu Pro Ala Leu Leu Val Leu 
15 10 15 

Leu Gly Ala Leu Phe Pro Gly Pro Gly Asn Ala Gin Thr Ser Val Ser 
20 25 30 

Pro Ser Lys Val lie Leu Pro Arg Gly Gly Ser Val Leu Val Thr Cys 
35 40 45 

Ser Thr Ser Cys Asp Gin Pro Lys Leu Leu Gly lie Glu Thr Pro Leu 
50 55 60 

Pro Lys Lys Glu Leu Leu Leu Pro Gly Asn Asn Arg Lys Val Tyr Glu 
65 70 75 80 

Leu Ser Asn Val Gin Glu Asp Ser Gin Pro Met Cys Tyr Ser Asn Cys 
85 90 95 

Pro Asp Gly Gin Ser Thr Ala Lys Thr Phe Leu Thr Val Tyr Trp Thr 
100 105 110 

Pro Glu Arg Val Glu Leu Ala Pro Leu Pro Ser Trp Gin Pro Val Gly 
115 120 125 

Lys Asn Leu Thr Leu Arg Cys Gin Val Glu Gly Gly Ala Pro Arg Ala 
130 135 140 

Asn Leu Thr Val Val Leu Leu Arg Gly Glu Lys Glu Leu Lys Arg Glu 
145 150 155 160 

Pro Ala Val Gly Glu Pro Ala Glu Val Thr Thr Thr Val Leu Val Arg 
165 170 175 

Arg Asp His His Gly Ala Asn Phe Ser Cys Arg Thr Glu Leu Asp Leu 
180 185 190 

Arg Pro Gin Gly Leu Glu Leu Phe Glu Asn Thr Ser Ala Pro Tyr Gin 
195 200 205 

Leu Gin Thr Phe Val Leu Pro Ala Thr Pro Pro Gin Leu Val Ser Pro 
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210 215 220 

Arg Val Leu Glu Val Asp Thr Gin Gly Thr Val Val Cys Ser Leu Asp 
225 230 235 240 

Gly Leu Phe Pro Val Ser Glu Ala Gin Val His Leu Ala Leu Gly Asp 
245 250 255 

Gin Arg Leu Asn Pro Thr Val Thr Tyr Gly Asn Asp Ser Phe Ser Ala 
260 265 270 

Lys Ala Ser Val Ser Val Thr Ala Glu Asp Glu Gly Thr Gin Arg Leu 
275 280 285 

Thr Cys Ala Val lie Leu Gly Asn Gin Ser Gin Glu Thr Leu Gin Thr 
290 295 300 

Val Thr lie Tyr Ser Phe Pro Ala Pro Asn Val lie Leu Thr Lys Pro 
305 310 315 320 

Glu Val Ser Glu Gly Thr Glu Val Thr Val Lys Cys Glu Ala His Pro 
325 330 335 

Arg Ala Lys Val Thr Leu Asn Gly Val Pro Ala Gin Pro Leu Gly Pro 
340 345 350 

Arg Ala Gin Leu Leu Leu Lys Ala Thr Pro Glu Asp Asn Gly Arg Ser 
355 360 365 

Phe Ser Cys Ser Ala Thr Leu Glu Val Ala Gly Gin Leu lie His Lys 
370 375 380 

Asn Gin Thr Arg Glu Leu Arg Val Leu Tyr Gly Pro Arg Leu Asp Glu 
385 390 395 400 

Arg Asp Cys Pro Gly Asn Trp Thr Trp Pro Glu Asn Ser Gin Gin Thr 
405 410 415 

Pro Met Cys Gin Ala Trp Gly Asn Pro Leu Pro Glu Leu Lys Cys Leu 
420 425 430 

Lys Asp Gly Thr Phe Pro Leu Pro lie Gly Glu Ser Val Thr Val Thr 
435 440 445 

Arg Asp Leu Glu Gly Thr Tyr Leu Cys Arg Ala Arg Ser Thr Gin Gly 
450 455 460 

Glu Val Thr Arg Glu Val Thr Val Asn Val Leu Ser Pro Arg Tyr Glu 
465 470 475 480 

He Val He He Thr Val Val Ala Ala Ala Val He Met Gly Thr Ala 
485 490 495 

Gly Leu Ser Thr Tyr Leu Tyr Asn Arg Gin Arg Lys He Lys Lys Tyr 
500 505 510 

Arg Leu Gin Gin Ala Gin Lys Gly Thr Pro Met Lys Pro Asn Thr Gin 
515 520 525 



Ala Thr Pro Pro 
530 
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<210> 64 
<211> 275 
<212> PRT 

<213> Homo sapiens 
<400> 64 

Met Ser Ser Phe Gly Tyr Arg Thr Leu Thr Val Ala Leu Phe Thr Leu 
15 10 15 

lie Cys Cys Pro Gly Ser Asp Glu Lys Val Phe Glu Val His Val Arg 
20 25 30 

Pro Lys Lys Leu Ala Val Glu Pro Lys Gly Ser Leu Glu Val Asn Cys 
35 40 45 

Ser Thr Thr Cys Asn Gin Pro Glu Val Gly Gly Leu Glu Thr Ser Leu 
50 55 60 

Asp Lys lie Leu Leu Asp Glu Gin Ala Gin Trp Lys His Tyr Leu Val 
65 70 75 80 

Ser Asn lie Ser His Asp Thr Val Leu Gin Cys His Phe Thr Cys Ser 
85 90 95 

Gly Lys Gin Glu Ser Met Asn Ser Asn Val Ser Val Tyr Gin Pro Pro 
100 105 110 

Arg Gin Val lie Leu Thr Leu Gin Pro Thr Leu Val Ala Val Gly Lys 
115 120 125 

Ser Phe Thr lie Glu Cys Arg Val Pro Thr Val Glu Pro Leu Asp Ser 
130 135 140 

Leu Thr Leu Phe Leu Phe Arg Gly Asn Glu Thr Leu His Tyr Glu Thr 
145 150 155 160 

Phe Gly Lys Ala Ala Pro Ala Pro Gin Glu Ala Thr Ala Thr Phe Asn 
165 170 175 

Ser Thr Ala Asp Arg Glu Asp Gly His Arg Asn Phe Ser Cys Leu Ala 
180 185 190 

Val Leu Asp Leu Met Ser Arg Gly Gly Asn lie Phe His Lys His Ser 
195 200 205 

Ala Pro Lys Met Leu Glu lie Tyr Glu Pro Val Ser Asp Ser Gin Met 
210 215 220 

Val lie He Val Thr Val Val Ser Val Leu Leu Ser Leu Phe Val Thr 
225 230 235 240 

Ser Val Leu Leu Cys Phe He Phe Gly Gin His Leu Arg Gin Gin Arg 
245 250 255 

Met Gly Thr Tyr Gly Val Arg Ala Ala Trp Arg Arg Leu Pro Gin Ala 
260 265 270 



Phe Arg Pro 
275 
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<210> 65 
<211> 547 
<212> PRT 

<213> Homo sapiens 
<400> 65 

Met Ala Thr Met Val Pro Ser Val Leu Trp Pro Arg Ala Cys Trp Thr 
15 10 15 

Leu Leu Val Cys Cys Leu Leu Thr Pro Gly Val Gin Gly Gin Glu Phe 
20 25 30 

Leu Leu Arg Val Glu Pro Gin Asn Pro Val Leu Ser Ala Gly Gly Ser 
35 40 45 

Leu Phe Val Asn Cys Ser Thr Asp Cys Pro Ser Ser Glu Lys lie Ala 
50 55 60 

Leu Glu Thr Ser Leu Ser Lys Glu Leu Val Ala Ser Gly Met Gly Trp 
65 70 75 80 

Ala Ala Phe Asn Leu Ser Asn Val Thr Gly Asn Ser Arg lie Leu Cys 
85 90 95 

Ser Val Tyr Cys Asn Gly Ser Gin lie Thr Gly Ser Ser Asn lie Thr 
100 105 110 

Val Tyr Gly Leu Pro Glu Arg Val Glu Leu Ala Pro Leu Pro Pro Trp 
115 120 125 

Gin Pro Val Gly Gin Asn Phe Thr Leu Arg Cys Gin Val Glu Gly Gly 
130 135 140 

Ser Pro Arg Thr Ser Leu Thr Val Val Leu Leu Arg Trp Glu Glu Glu 
145 150 155 160 

Leu Ser Arg Gin Pro Ala Val Glu Glu Pro Ala Glu Val Thr Ala Thr 
165 170 175 

Val Leu Ala Ser Arg Asp Asp His Gly Ala Pro Phe Ser Cys Arg Thr 
180 185 190 

Glu Leu Asp Met Gin Pro Gin Gly Leu Gly Leu Phe Val Asn Thr Ser 
195 200 205 

Ala Pro Arg Gin Leu Arg Thr Phe Val Leu Pro Val Thr Pro Pro Arg 
210 215 220 

Leu Val Ala Pro Arg Phe Leu Glu Val Glu Thr Ser Trp Pro Val Asp 
225 230 235 240 

Cys Thr Leu Asp Gly Leu Phe Pro Ala Ser Glu Ala Gin Val Tyr Leu 
245 250 255 

Ala Leu Gly Asp Gin Met Leu Asn Ala Thr Val Met Asn His Gly Asp 
260 265 270 

Thr Leu Thr Ala Thr Ala Thr Ala Thr Ala Arg Ala Asp Gin Glu Gly 
275 280 285 
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Ala Arg Glu lie Val Cys Asn Val Thr Leu Gly Gly Glu Arg Arg Glu 
290 295 300 

Ala Arg Glu Asn Leu Thr Val Phe Ser Phe Leu Gly Pro lie Val Asn 
305 310 315 320 

Leu Ser Glu Pro Thr Ala His Glu Gly Ser Thr Val Thr Val Ser Cys 
325 330 335 

Met Ala Gly Ala Arg Val Gin Val Thr Leu Asp Gly Val Pro Ala Ala 
340 345 350 

Ala Pro Gly Gin Pro Ala Gin Leu Gin Leu Asn Ala Thr Glu Ser Asp 
355 360 365 

Asp Gly Arg Ser Phe Phe Cys Ser Ala Thr Leu Glu Val Asp Gly Glu 
370 375 380 

Phe Leu His Arg Asn Ser Ser Val Gin Leu Arg Val Leu Tyr Gly Pro 
385 390 395 400 

Lys lie Asp Arg Ala Thr Cys Pro Gin His Leu Lys Trp Lys Asp Lys 
405 410 415 

Thr Arg His Val Leu Gin Cys Gin Ala Arg Gly Asn Pro Tyr Pro Glu 
420 425 430 

Leu Arg Cys Leu Lys Glu Gly Ser Ser Arg Glu Val Pro Val Gly lie 
435 440 445 

Pro Phe Phe Val Asn Val Thr His Asn Gly Thr Tyr Gin Cys Gin Ala 
450 455 460 

Ser Ser Ser Arg Gly Lys Tyr Thr Leu Val Val Val Met Asp lie Glu 
465 470 475 480 

Ala Gly Ser Ser His Phe Val Pro Val Phe Val Ala Val Leu Leu Thr 
485 490 495 

Leu Gly Val Val Thr lie Val Leu Ala Leu Met Tyr Val Phe Arg Glu 
500 505 510 

His Gin Arg Ser Gly Ser Tyr His Val Arg Glu Glu Ser Thr Tyr Leu 
515 520 525 

Pro Leu Thr Ser Met Gin Pro Thr Glu Ala Met Gly Glu Glu Pro Ser 
530 535 540 

Arg Ala Glu 
545 



<210> 66 
<211> 577 
<212> PRT 

<213> Homo sapiens 



<400> 66 

Gly Val Pro Glu Glu Leu Phe Glu Val Ser lie Trp Pro Ser Gin Ala 
1 5 10 15 
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Leu Val Glu Phe 
20 

Pro Asp Pro Gly 
35 

Val Gly Lys Gly 
50 

Glu Asn Ser lie 
65 

Asp Thr Ser Leu 



Leu Glu Leu Gin 
100 

Lys Cys His Val 
115 

Leu Leu Gin Gly 
130 

Ala Val Ala Ser 
145 

Lys Glu Asn Asp 



Ser Leu Gin Gly 
180 

Val Arg lie Phe 
195 

Leu Leu Glu Ala 
210 

Val Phe Pro Ala 
225 

Glu Leu Ser Ser 



Ala Thr lie Arg 
260 

Ala Ser Leu Gly 
275 

Tyr Asn Lys Trp 
290 

Lys His Arg Val 
305 

Leu Leu Gly Met 



Gly Gin Ser Leu 



Pro Ser Gly lie 
40 

Pro Gin Trp Lys 
55 

Leu Gin Cys Phe 
70 

Gly lie Thr Val 
85 

Pro Ala Trp Val 



Pro Ser Val Ala 
120 

Asn Gin Glu Leu 
135 

Gin Arg Ala Glu 
150 

Arg Cys Asn Ser 
165 

Gly Arg Leu Phe 

Glu Phe Ser Gin 
200 

Gly Met Ala Glu 
215 

Lys Glu Val Met 
230 

Phe Leu Ser Trp 
245 

Thr Met Glu Ala 



Ala Met Glu Gin 
280 

Pro Gly Ser Ser 
2 95 

Thr Gly Trp Phe 
310 

Leu Ser Ser Glu 
325 



Val Val Asn Cys 
25 

Glu Thr Phe Leu 



Glu Phe Leu Leu 
60 

Phe Ser Cys Ala 
75 

Tyr Gin Pro Pro 
90 

Ala Val Asp Glu 
105 

Pro. Leu Glu Ser 



His Arg Lys Asn 
140 

Val lie lie Ser 
155 

Ser Cys His Ala 
170 

Gin Gly Ser Ser 
185 

Ser Pro His lie 



Thr Val Ser Cys 
220 

Phe His Met Phe 
235 

Glu Gly Asp Thr 
250 

Gly Asp Gin Glu 
265 

Lys Thr Arg Lys 



Phe Phe lie Arg 
300 

Gly Cys Arg His 
315 

His Glu Ser Ser 
330 



Ser Thr Thr Cys 
30 

Lys Lys Thr Gin 
45 

Glu Asp Val Thr 



Gly lie Gin Lys 
80 

Glu Gin Val lie 
95 

Ala Phe Thr Val 
110 

Leu Thr Leu Ala 
125 

Phe Thr Ser Leu 



Val Arg Ala Gin 
160 

Glu Leu Asp Leu 
175 

Pro lie Arg lie 
190 

Trp Val Ser Ser 
205 

Glu Val Ala Arg 



Leu Glu Asp Gin 
240 

Ala Trp Ala Asn 
255 

Leu Ser Cys Phe 
270 

Leu Val His Ser 
285 

Val Leu Cys Cys 



Pro Cys Cys Pro 
320 

Ser Phe Ser Gly 
335 
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Phe Pro Pro Pro lie Leu Glu Leu Lys Glu Ser Tyr Pro Leu Ala Gly 
340 345 350 

Thr Asp lie Asn Val Thr Cys Ser Gly His Val Leu Thr Ser Pro Ser 
355 360 365 

Pro Thr Leu Arg Leu Gin Gly Ala Pro Asp Leu Pro Ala Gly Glu Pro 
370 375 380 

Ala Trp Leu Leu Leu Thr Ala Arg Glu Glu Asp Asp Gly Asn Phe Ser 
385 390 395 400 

Cys Glu Ala Ser Leu Val Val Gin Gly Gin Arg Leu Met Lys Thr Thr 
405 410 415 

Val lie Gin Leu His lie Leu Cys Lys Pro Gin Leu Glu Glu Ser Ser 
420 425 430 

Cys Pro Gly Lys Gin Thr Trp Leu Glu Gly Met Glu His Thr Leu Ala 
435 440 445 

Cys Val Pro Lys Gly Asn Pro Ala Pro Ala Leu Val Cys Thr Trp Asn 
450 455 460 

Gly Val Val Phe Asp Leu Glu Val Pro Gin Lys Ala Thr Asn His Thr 
465 470 475 480 

Gly Thr Tyr Arg Tyr Thr Ala Thr Asn Gin Leu Gly Ser Val Ser Lys 
485 490 495 

Asp lie Ala Val lie Val Gin Gly Leu Asp Glu Gly lie Ser Ser Thr 
500 505 510 

Leu Phe Val lie lie Thr Val Ala Leu Gly Val Gly Val He Thr He 
515 520 525 

Ala Leu Tyr Leu Ser Tyr Arg Pro Cys Lys Val Asp Arg Arg Lys Leu 
530 535 540 

Leu Tyr Arg Gin Lys Glu Glu Asp Lys Glu Glu Glu Ser Gin Phe Ala 
545 550 555 560 

Val Gin Glu Glu Lys Ser Thr Thr His He lie Asp Ser Tyr Leu He 
565 570 575 

Glu 



<210> 67 

<211> 924 

<212> PRT 

<213> Homo sapiens 

<400> 67 

Met Pro Gly Pro Ser Pro Gly Leu Arg Arg Ala Leu Leu Gly Leu Trp 
15 10 15 

Ala Ala Leu Gly Leu Gly Leu Phe Gly Leu Ser Ala Val Ser Gin Glu 
20 25 30 



Pro Phe Trp Ala Asp Leu Gin Pro Arg Val Ala Phe Val Glu Arg Gly 
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35 40 45 

Gly Ser Leu Trp Leu Asn Cys Ser Thr Asn Cys Pro Arg Pro Glu Arg 
50 55 60 

Gly Gly Leu Glu Thr Ser Leu Arg Arg Asn Gly Thr Gin Arg Gly Leu 
65 70 75 80 

Arg Trp Leu Ala Arg Gin Leu Val Asp lie Arg Glu Pro Glu Thr Gin 
85 90 95 

Pro Val Cys Phe Phe Arg Cys Ala Arg Arg Thr Leu Gin Ala Arg Gly 
100 105 110 

Leu lie Arg Thr Phe Gin Arg Pro Asp Arg Val Glu Leu Met Pro Leu 
115 120 125 

Pro Pro Trp Gin Pro Val Gly Glu Asn Phe Thr Leu Ser Cys Arg Val 
130 135 140 

Pro Gly Ala Gly Pro Arg Ala Ser Leu Thr Leu Thr Leu Leu Arg Gly 
145 150 155 160 

Ala Gin Glu Leu lie Arg Arg Ser Phe Ala Gly Glu Pro Pro Arg Ala 
165 170 175 

Arg Gly Ala Val Leu Thr Ala Thr Val Leu Ala Arg Arg Glu Asp His 
180 185 190 

Gly Ala Asn Phe Ser Cys Arg Ala Glu Leu Asp Leu Arg Pro His Gly 
195 200 205 

Leu Gly Leu Phe Glu Asn Ser Ser Ala Pro Arg Glu Leu Arg Thr Phe 
210 215 220 

Ser Leu Ser Pro Asp Ala Pro Arg Leu Ala Ala Pro Arg Leu Leu Glu 
225 230 235 240 

Val Gly Ser Glu Arg Pro Val Ser Cys Thr Leu Asp Gly Leu Phe Pro 
245 250 255 

Ala Ser Glu Ala Arg Val Tyr Leu Ala Leu Gly Asp Gin Asn Leu Ser 
260 265 270 

Pro Asp Val Thr Leu Glu Gly Asp Ala Phe Val Ala Thr Ala Thr Ala 
275 280 285 

Thr Ala Ser Ala Glu Gin Glu Gly Ala Arg Gin Leu lie Cys Asn Val 
290 295 300 

Thr Leu Gly Gly Glu Asn Arg Glu Thr Arg Glu Asn Val Thr lie Tyr 
305 310 315 320 

Ser Phe Pro Ala Pro Leu Leu Thr Leu Ser Glu Pro Ser Val Ser Glu 
325 330 335 

Gly Gin Met Val Thr Val Thr Cys Ala Ala Gly Thr Gin Ala Leu Val 
340 345 350 



Thr Leu Glu Gly Val Pro Ala Ala Val Pro Gly Gin Pro Ala Gin Leu 
3 55 3 60 3 65 
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Gin Leu Asn Ala 
370 

Ala Thr Leu Asp 
385 

Glu Leu Arg Val 



Arg Ser Trp Thr 
420 

Ala Arg Gly Asn 
435 

Gly Ala Val Leu 
450 

Ser Gly Thr Tyr 
465 

Lys Asp Val Thr 



Gly Cys Pro Glu 
500 

Ser Cys Val Ala 
515 

Ser Gly Glu Leu 
530 

Glu His Ala Gly 
545 

Ala Ala Lys Asn 



Glu Pro Ser Cys 
580 

Leu Phe Ser Cys 
595 

Val Gly Ser Gly 
610 

Pro Asp Pro Ser 
625 

Gly lie Tyr Val 



Thr Val Val Val 
660 

Cys Pro Ser His 
675 



Thr Glu Asn Asp 
375 

Val Asp Gly Glu 
390 

Leu Tyr Ala Pro 
405 

Trp Pro Glu Gly 



Pro Glu Pro Ser 
440 

Ala Leu Gly Leu 
455 

Arg Cys Lys Ala 
470 

Leu Thr Val Glu 
485 

Arg lie Thr Trp 



His Gly Val Pro 
520 

Gly Ala Val lie 
535 

Thr Tyr Arg Cys 
550 

Val Ala Val Thr 
565 

Pro Ser Asn Trp 



Glu Val Asp Gly 
600 

Gly Ala Thr Glu 
615 

Pro Arg Ala Pro 
630 

Cys Asn Ala Thr 
645 

Ser Ala Glu Ser 



Gin Thr Trp Leu 
680 



Asp Arg Arg Ser 
380 

Thr Leu lie Lys 
395 

Arg Leu Asp Asp 
410 

Pro Glu Gin Thr 
425 

Val His Cys Ala 



Leu Gly Pro Val 
460 

Ala Asn Asp Gin 
475 

Tyr Ala Pro Ala 
490 

Leu Glu Gly Thr 
505 

Pro Pro Asp Val 



Glu Gly Leu Leu 
540 

Glu Ala Thr Asn 
555 

Val Glu Tyr Gly 
570 

Thr Trp Val Glu 
585 

Lys Pro Gin Pro 



Gly Val Leu Leu 
620 

Arg lie Pro Arg 
635 

Asn Arg His Gly 
650 

Pro Pro Glu Met 
665 

Glu Gly Ala Glu 



Phe Phe Cys Asp 



Asn Arg Ser Ala 
400 

Ser Asp Cys Pro 
415 

Leu Arg Cys Glu 
430 

Arg Ser Asp Gly 
445 

Thr Arg Ala Leu 



Gly Glu Ala Val 
480 

Leu Asp Ser Val 
495 

Glu Ala Ser Leu 
510 

lie Cys Val Arg 
525 

Arg Val Ala Arg 



Pro Arg Gly Ser 
560 

Pro Arg Phe Glu 
575 

Gly Ser Gly Arg 
590 

Ser Val Lys Cys 
605 

Pro Leu Ala Pro 



Val Leu Ala Pro 
640 

Ser Val Ala Lys 
655 

Asp Glu Ser Thr 
670 

Ala Ser Ala Leu 
685 
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Ala Cys Ala Ala Arg Gly Arg Pro Ser Pro Gly Val Arg Cys Ser Arg 
690 695 700 

Glu Gly lie Pro Trp Pro Glu Gin Gin Arg Val Ser Arg Glu Asp Ala 
705 710 715 720 

Gly Thr Tyr His Cys Val Ala Thr Asn Ala His Gly Thr Asp Ser Arg 
725 730 735 

Thr Val Thr Val Gly Val Glu Tyr Arg Pro Val Val Ala Glu Leu Ala 
740 745 750 

Ala Ser Pro Pro Gly Gly Val Arg Pro Gly Gly Asn Phe Thr Leu Thr 
755 760 765 

Cys Arg Ala Glu Ala Trp Pro Pro Ala Gin lie Ser Trp Arg Ala Pro 
770 775 780 

Pro Gly Ala Leu Asn lie Gly Leu Ser Ser Asn Asn Ser Thr Leu Ser 
785 790 795 800 

Val Ala Gly Ala Met Gly Ser His Gly Gly Glu Tyr Glu Cys Ala Arg 
805 810 815 

Thr Asn Ala His Gly Arg His Ala Arg Arg lie Thr Val Arg Val Ala 
820 825 830 

Gly Pro Trp Leu Trp Val Ala Val Gly Gly Ala Ala Gly Gly Ala Ala 
835 840 845 

Leu Leu Ala Ala Gly Ala Gly Leu Ala Phe Tyr Val Gin Ser Thr Ala 
850 855 860 

Cys Lys Lys Gly Glu Tyr Asn Val Gin Glu Ala Glu Ser Ser Gly Glu 
865 870 875 880 

Ala Val Cys Leu Asn Gly Ala Gly Gly Gly Ala Gly Gly Ala Ala Gly 
885 890 895 

Ala Glu Gly Gly Pro Glu Ala Ala Gly Gly Ala Ala Glu Ser Pro Ala 
900 905 910 

Glu Gly Glu Val Phe Ala He Gin Leu Thr Ser Ala 
915 920 



<210> 68 
<211> 406 
<212> PRT 

<213> Homo sapiens 
<400> 68 

Met Asp Phe Gly Leu Ala Leu Leu Leu Ala Gly Leu Leu Gly Leu Leu 
15 10 15 

Leu Gly Gin Ser Leu Gin Val Lys Pro Leu Gin Val Glu Pro Pro Glu 
20 25 30 



Pro Val Val Ala Val Ala Leu Gly Ala Ser Arg Gin Leu Thr Cys Arg 
35 40 45 
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Leu Ala Cys Ala Asp Arg Gly Ala Ser Val Gin Trp Arg Gly Leu Asp 
50 55 60 

Thr Ser Leu Gly Ala Val Gin Ser Asp Thr Gly Arg Ser Val Leu Thr 
65 70 75 80 

Val Arg Asn Ala Ser Leu Ser Ala Ala Gly Thr Arg Val Cys Val Gly 
85 90 95 

Ser Cys Gly Gly Arg Thr Phe Gin His Thr Val Gin Leu Leu Val Tyr 
100 105 110 

Ala Phe Pro Asp Gin Leu Thr Val Ser Pro Ala Ala Leu Val Pro Gly 
115 120 125 

Asp Pro Glu Val Ala Cys Thr Ala His Lys Val Thr Pro Val Asp Pro 
130 135 140 

Asn Ala Leu Ser Phe Ser Leu Leu Val Gly Gly Gin Glu Leu Glu Gly 
145 150 155 160 

Ala Gin Ala Leu Gly Pro Glu Val Gin Glu Glu Glu Glu Glu Pro Gin 
165 170 175 

Gly Asp Glu Asp Val Leu Phe Arg Val Thr Glu Arg Trp Arg Leu Pro 
180 185 190 

Pro Leu Gly Thr Pro Val Pro Pro Ala Leu Tyr Cys Gin Ala Thr Met 
195 200 205 

Arg Leu Pro Gly Leu Glu Leu Ser His Arg Gin Ala lie Pro Val Leu 
210 215 220 

His Ser Pro Thr Ser Pro Glu Pro Pro Asp Thr Thr Ser Pro Glu Pro 
225 230 235 240 

Pro Asn Thr Thr Ser Pro Glu Ser Pro Asp Thr Thr Ser Pro Glu Ser 
245 250 255 

Pro Asp Thr Thr Ser Gin Glu Pro Pro Asp Thr Thr Ser Gin Glu Pro 
260 265 270 

Pro Asp Thr Thr Ser Gin Glu Pro Pro Asp Thr Thr Ser Pro Glu Pro 
275 280 285 

Pro Asp Lys Thr Ser Pro Glu Pro Ala Pro Gin Gin Gly Ser Thr His 
290 295 300 

Thr Pro Arg Ser Pro Gly Ser Thr Arg Thr Arg Arg Pro Glu lie Ser 
305 310 315 320 

Gin Ala Gly Pro Thr Gin Gly Glu Val lie Pro Thr Gly Ser Ser Lys 
325 330 335 

Pro Ala Gly Asp Gin Leu Pro Ala Ala Leu Trp Thr Ser Ser Ala Val 
340 345 350 

Leu Gly Leu Leu Leu Leu Ala Leu Pro Thr Tyr His Leu Trp Lys Arg 
355 360 365 



Cys Arg His Leu Ala Glu Asp Asp Thr His Pro Pro Ala Ser Leu Arg 
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375 380 
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Leu Leu Pro Gin Val Ser Ala Trp Ala Gly Leu Arg Gly Thr Gly Gin 
385 390 395 400 

Val Gly lie Ser Pro Ser 
405 



<210> 69 
<211> 739 
<212> PRT 

<213> Homo sapiens 
<400> 69 

Met Pro Gly Lys Met Val Val lie Leu Gly Ala Ser Asn lie Leu Trp 
15 10 15 

lie Met Phe Ala Ala Ser Gin Ala Phe Lys lie Glu Thr Thr Pro Glu 
20 25 30 

Ser Arg Tyr Leu Ala Gin lie Gly Asp Ser Val Ser Leu Thr Cys Ser 
35 40 45 

Thr Thr Gly Cys Glu Ser Pro Phe Phe Ser Trp Arg Thr Gin lie Asp 
50 55 60 

Ser Pro Leu Asn Gly Lys Val Thr Asn Glu Gly Thr Thr Ser Thr Leu 
65 70 75 80 

Thr Met Asn Pro Val Ser Phe Gly Asn Glu His Ser Tyr Leu Cys Thr 
85 90 95 

Ala Thr Cys Glu Ser Arg Lys Leu Glu Lys Gly lie Gin Val Glu lie 
100 105 110 

Tyr Ser Phe Pro Lys Asp Pro Glu lie His Leu Ser Gly Pro Leu Glu 
115 120 125 

Ala Gly Lys Pro lie Thr Val Lys Cys Ser Val Ala Asp Val Tyr Pro 
130 135 140 

Phe Asp Arg Leu Glu lie Asp Leu Leu Lys Gly Asp His Leu Met Lys 
145 150 155 160 

Ser Gin Glu Phe Leu Glu Asp Ala Asp Arg Lys Ser Leu Glu Thr Lys 
165 170 175 

Ser Leu Glu Val Thr Phe Thr Pro Val He Glu Asp He Gly Lys Val 
180 185 190 

Leu Val Cys Arg Ala Lys Leu His He Asp Glu Met Asp Ser Val Pro 
195 200 205 

Thr Val Arg Gin Ala Val Lys Glu Leu Gin Val Tyr He Ser Pro Lys 
210 215 220 

Asn Thr Val He Ser Val Asn Pro Ser Thr Lys Leu Gin Glu Gly Gly 
225 230 235 240 



Ser Val Thr Met Thr Cys Ser Ser Glu Gly Leu Pro Ala Pro Glu He 
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255 



Phe Trp Ser Lys 
260 

Asn Ala Thr Leu 
275 

Tyr Val Cys Glu 
290 

Glu Leu lie Val 
305 

Pro Arg lie Ala 



Val Met Gly Cys 
340 

Ser Pro Leu Ser 
355 

Thr Leu Ser Pro 
370 

Val Thr Cys Gly 
385 

Tyr Ser Phe Pro 



Asn Gly Ser Ser 
420 

Leu Asp Arg Leu 
435 

Asn lie Glu Phe 
450 

Ser Leu Glu Met 
465 

Leu Val Cys Gin 



Lys Gin Arg Gin 
500 

Asp Thr Thr Val 
515 

Ser Val Asn Met 
530 

Leu Trp Ser Arg 
545 

Asn Ala Thr Leu 



Lys Leu Asp Asn 



Thr Leu lie Ala 
280 

Gly Val Asn Leu 
295 

Gin Glu Lys Pro 
310 

Ala Gin lie Gly 
325 

Glu Ser Pro Ser 



Gly Lys Val Arg 
360 

Val Ser Phe Glu 
375 

His Lys Lys Leu 
390 

Arg Asp Pro Glu 
405 

Val Thr Val Ser 



Glu lie Glu Leu 
440 

Leu Glu Asp Thr 
455 

Thr Phe lie Pro 
470 

Ala Lys Leu His 
485 

Ser Thr Gin Thr 



Leu Val Ser Pro 
520 

Thr Cys Leu Ser 
535 

Gin Leu Pro Asn 
550 

Thr Leu He Ser 
565 



Gly Asn Leu Gin 
265 

Met Arg Met Glu 



He Gly Lys Asn 
300 

Phe Thr Val Glu 
315 

Asp Ser Val Met 
330 

Phe Ser Trp Arg 
345 

Ser Glu Gly Thr 



Asn Glu His Ser 
380 

Glu Lys Gly He 
395 

He Glu Met Ser 
410 

Cys Lys Val Pro 
425 

Leu Lys Gly Glu 



Asp Met Lys Ser 
460 

Thr He Glu Asp 
475 

He Asp Asp Met 
490 

Leu Tyr Val Asn 
505 

Ser Ser He Leu 



Gin Gly Phe Pro 
540 

Gly Glu Leu Gin 
555 

Thr Lys Met Glu 
570 



His Leu Ser Gly 
270 

Asp Ser Gly He 
2 85 

Arg Lys Glu Val 



He Ser Pro Gly 
320 

Leu Thr Cys Ser 
335 

Thr Gin He Asp 
350 

Asn Ser Thr Leu 
365 

Tyr Leu Cys Thr 



Gin Val Glu Leu 
400 

Gly Gly Leu Val 
415 

Ser Val Tyr Pro 
430 

Thr He Leu Glu 
445 

Leu Glu Asn Lys 



Thr Gly Lys Ala 
480 

Glu Phe Glu Pro 
495 

Val Ala Pro Arg 
510 

Glu Glu Gly Ser 
525 

Ala Pro Lys He 



Pro Leu Ser Glu 
560 

Asp Ser Gly Val 
575 
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Tyr Leu Cys Glu Gly lie Asn Gin Ala Gly Arg Ser Arg Lys Glu Val 
580 585 590 

Glu Leu He He Gin Val Thr Pro Lys Asp He Lys Leu Thr Ala Phe 
595 600 605 

Pro Ser Glu Ser Val Lys Glu Gly Asp Thr Val He He Ser Cys Thr 
610 615 620 

Cys Gly Asn Val Pro Glu Thr Trp He He Leu Lys Lys Lys Ala Glu 
625 630 635 640 

Thr Gly Asp Thr Val Leu Lys Ser He Asp Gly Ala Tyr Thr He Arg 
645 650 655 

Lys Ala Gin Leu Lys Asp Ala Gly Val Tyr Glu Cys Glu Ser Lys Asn 
660 665 670 

Lys Val Gly Ser Gin Leu Arg Ser Leu Thr Leu Asp Val Gin Gly Arg 
675 680 685 

Glu Asn Asn Lys Asp Tyr Phe Ser Pro Glu Leu Leu Val Leu Tyr Phe 
690 695 700 

Ala Ser Ser Leu He He Pro Ala He Gly Met He He Tyr Phe Ala 
705 710 715 720 

Arg Lys Ala Asn Met Lys Gly Ser Tyr Ser Leu Val Glu Ala Gin Lys 
725 730 735 

Ser Lys Val 



<210> 70 

<211> 537 

<212> PRT 

<213> Mus musculus 

<400> 70 

Met Ala Ser Thr Arg Ala Lys Pro Thr Leu Pro Leu Leu Leu Ala Leu 
15 10 15 

Val Thr Val Val He Pro Gly Pro Gly Asp Ala Gin Val Ser He His 
20 25 30 

Pro Arg Glu Ala Phe Leu Pro Gin Gly Gly Ser Val Gin Val Asn Cys 
35 40 45 

Ser Ser Ser Cys Lys Glu Asp Leu Ser Leu Gly Leu Glu Thr Gin Trp 
50 55 60 

Leu Lys Asp Glu Leu Glu Ser Gly Pro Asn Trp Lys Leu Phe Glu Leu 
65 70 75 80 

Ser Glu He Gly Glu Asp Ser Ser Pro Leu Cys Phe Glu Asn Cys Gly 
85 90 95 



Thr Val Gin Ser Ser Ala Ser Ala Thr He Thr Val Tyr Ser Phe Pro 
100 105 110 
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Glu Ser val Glu Leu 
115 

Asp Leu Thr Leu Arg 
130 

Leu Ser Ala Val Leu 
145 

Val Gly Gly His Pro 
165 

Ala Ser Arg Gly Asp 
180 

Asp Leu Arg Pro Gin 
195 

Arg Ser Leu Arg Thr 
210 

Thr Pro Asp Leu Leu 
225 

Leu Glu Gly Leu Phe 
245 

Gly Gly Gin Met Pro 
260 

Ser Ala Thr Ala Leu 
275 

Pro Leu Arg Cys Val 
290 

Arg Thr Leu Thr Val 
305 

Gin Leu Glu Val Ser 
325 

His Ser Gly Ser Lys 
340 

Pro Thr Pro Gin Val 
355 

Lys Arg Ser Phe Phe 
370 

Leu Phe Lys Asn Gin 
385 

Leu Asp Glu Thr Asp 
405 

Gin Gin Thr Leu Lys 
420 

Thr Cys Arg Arg Lys 



65/119 

Arg Pro Leu Pro Ala 
120 

Cys His Val Asp Gly 
13 5 

Leu Arg Gly Glu Glu 
150 

Lys Asp Pro Lys Glu 
170 

His Gly Ala Asn Phe 
185 

Gly Leu Ala Leu Phe 
200 

Phe Asp Leu Pro Ala 
215 

Glu Val Gly Thr Gin 
230 

Pro Ala Ser Glu Ala 
250 

Thr Gin Glu Ser Thr 
265 

Val Glu Val Thr Glu 
280 

Leu Glu Leu Ala Asp 
295 

Tyr Asn Phe Ser Ala 
310 

Glu Gly Ser Gin Val 
330 

Val Val Leu Leu Ser 
345 

Gin Phe Thr Leu Asn 
360 

Cys Ser Ala Ala Leu 
375 

Thr Leu Glu Leu His 
390 

Cys Leu Gly Asn Trp 
410 

Cys Gin Ala Trp Gly 
425 

Ala Asp Gly Ala Leu 



Trp Gin Gin Val Gly 
12 5 

Gly Ala Pro Arg Thr 
14 0 

lie Leu Ser Arg Gin 
155 

lie Thr Phe Thr Val 
175 

Ser Cys Arg Thr Glu 
190 

Ser Asn Val Ser Glu 
205 

Thr lie Pro Lys Leu 
220 

Gin Lys Leu Phe Cys 
235 

Arg lie Tyr Leu Glu 
255 

Asn Ser Ser Asp Ser 
270 

Glu Phe Asp Arg Thr 
285 

Gin lie Leu Glu Thr 
300 

Pro Val Leu Thr Leu 
315 

Thr Val Lys Cys Glu 
335 

Gly Val Glu Pro Arg 
350 

Ala Ser Ser Glu Asp 
365 

Glu Val Ala Gly Lys 
380 

Val Leu Tyr Gly Pro 
395 

Thr Trp Gin Glu Gly 
415 

Asn Pro Ser Pro Lys 
430 

Leu Pro lie Gly Val 



PCT/US02/34197 
Lys 

Gin 

Pro 
160 

Leu 

Leu 

Ala 

Asp 

Ser 
240 

Leu 

val 

Leu 

Gin 

Ser 
320 

Ala 

Pro 

His 

Phe 

Arg 
400 

Ser 
Met 
val 
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Lys Ser Val Lys 
450 

Ser Ser His Gly 
465 

His Ser Gin Asn 



Val lie Val Gly 
500 

Arg Lys lie Arg 
515 

Lys Leu Lys Gly 
530 



Gin Glu Met Asn 
455 

Asn Val Thr Arg 
470 

Asn Trp Thr lie 
485 

Leu Val Met Ala 



lie Tyr Lys Leu 
520 

Gin Ala Pro Pro 
535 



Gly Thr Tyr Val 
460 

Asn Val Tyr Leu 
475 

lie lie Leu Val 
490 

Ala Ser Tyr Val 
505 

Gin Lys Ala Gin 



Pro 



Cys His Ala Phe 



Thr Val Leu Tyr 
480 

Pro Val Leu Leu 
495 

Tyr Asn Arg Gin 
510 

Glu Glu Ala lie 
525 



<210> 71 

<211> 537 

<212> PRT 

<213> Mus musculus 

<400> 71 

Met Ala Ser Thr Arg Ala Lys Pro Thr Leu Pro Leu Leu Leu Ala Leu 
15 10 15 

Val Thr Val Val lie Pro Gly Pro Gly Asp Ala Gin Val Ser lie His 
20 25 30 

Pro Arg Glu Ala Phe Leu Pro Gin Gly Gly Ser Val Gin Val Asn Cys 
35 40 45 

Ser Ser Ser Cys Lys Glu Asp Leu Ser Leu Gly Leu Glu Thr Gin Trp 
50 55 60 

Leu Lys Asp Glu Leu Glu Ser Gly Pro Asn Trp Lys Leu Phe Glu Leu 
65 70 75 80 

Ser Glu lie Gly Glu Asp Ser Ser Pro Leu Cys Phe Glu Asn Cys Gly 
85 90 95 

Thr Val Gin Ser Ser Ala Ser Ala Thr lie Thr Val Tyr Ser Phe Pro 
100 105 110 

Glu Ser Val Glu Leu Arg Pro Leu Pro Ala Trp Gin Gin Val Gly Lys 
115 120 125 

Asp Leu Thr Leu Arg Cys His Val Asp Gly Gly Ala Pro Arg Thr Gin 
130 135 140 

Leu Ser Ala Val Leu Leu Arg Gly Glu Glu lie Leu Ser Arg Gin Pro 
145 150 155 160 

Val Gly Gly His Pro Lys Asp Pro Lys Glu lie Thr Phe Thr Val Leu 
165 170 175 



Ala Ser Arg Gly Asp His Gly Ala Asn Phe Ser Cys Arg Thr Glu Leu 
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190 



Asp Leu Arg Pro 
195 

Arg Ser Leu Arg 
210 

Thr Pro Asp Leu 
225 

Leu Glu Ala Leu 



Gly Gly Gin Met 
260 

Ser Ala Thr Ala 
275 

Pro Leu Arg Cys 
290 

Arg Thr Leu Thr 
305 

Gin Leu Glu Val 



His Ser Gly Ser 
340 

Pro Thr Pro Gin 
355 

Lys Arg Ser Phe 
370 

Leu Phe Lys Asn 
3 85 

Leu Asp Glu Thr 



Gin Gin Thr Leu 
420 

Thr Cys Arg Arg 
435 

Lys Ser Val Lys 
450 

Ser Ser His Gly 
465 

His Ser Gin Asn 



Val lie Val Gly 
500 



Gin Gly Leu Ala 
200 

Thr Phe Asp Leu 
215 

Leu Glu Val Gly 
230 

Phe Pro Ala Ser 
245 

Pro Thr Gin Glu 



Leu Val Glu Val 
280 

Val Leu Glu Leu 
295 

Val Tyr Asn Phe 
310 

Ser Glu Gly Ser 
325 

Lys Val Val Leu 



Val Gin Phe Thr 
360 

Phe Cys Ser Ala 
375 

Gin Thr Leu Glu 
390 

Asp Cys Leu Gly 
405 

Lys Cys Gin Ala 



Lys Ala Asp Gly 
440 

Gin Glu Met Asn 
455 

Asn Val Thr Arg 
470 

Asn Trp Thr lie 
485 

Leu Val Met Ala 



Leu Phe Ser Asn 



Pro Ala Thr lie 
220 

Thr Gin Gin Lys 
235 

Glu Ala Arg lie 
250 

Ser Thr Asn Ser 
265 

Thr Glu Glu Phe 



Ala Asp Gin lie 
300 

Ser Ala Pro Val 
315 

Gin Val Thr Val 
330 

Leu Ser Gly Val 
345 

Leu Asn Ala Ser 



Ala Leu Glu Val 
380 

Leu His Val Leu 
395 

Asn Trp Thr Trp 
410 

Trp Gly Asn Pro 
425 

Ala Leu Leu Pro 



Gly Thr Tyr Val 
460 

Asn Val Tyr Leu 
475 

He He Leu Val 
490 

Ala Ser Tyr Val 
505 



Val Ser Glu Ala 
205 

Pro Lys Leu Asp 



Leu Phe Cys Ser 
240 

Tyr Leu Glu Leu 
255 

Ser Asp Ser Val 
270 

Asp Arg Thr Leu 
285 

Leu Glu Thr Gin 



Leu Thr Leu Ser 
320 

Lys Cys Glu Ala 
335 

Glu Pro Arg Pro 
350 

Ser Glu Asp His 
365 

Ala Gly Lys Phe 



Tyr Gly Pro Arg 
400 

Gin Glu Gly Ser 
415 

Ser Pro Lys Met 
430 

He Gly Val Val 
445 

Cys His Ala Phe 



Thr Val Leu Tyr 
480 

Pro Val Leu Leu 
495 

Tyr Asn Arg Gin 
510 
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Arg Lys lie Arg lie Tyr Lys Leu Gin Lys Ala Gin Glu Glu Ala lie 
515 520 525 

Lys Leu Lys Gly Gin Ala Pro Pro Pro 
530 535 



<210> 72 
<211> 527 
<212> PRT 

<213> Cricetulus griseus 
<400> 72 

Met Ala Pro Thr Arg Ala Arg Pro Thr Pro Pro Leu Leu Leu Ala Leu 
15 10 15 

Val Ala Val Val lie Pro Gly Pro Gly Ser Ala Gin Val Ser lie His 
20 25 30 

Pro Lys Glu Ala Phe Leu Pro Arg Gly Ala Ser Met Gin Val Asn Cys 
35 40 45 

Ser Ser Ser Cys Ser Glu Asn Leu Ser Leu Gly Leu Glu Thr Gin Trp 
50 55 60 

Pro Lys Val Glu Leu Asp His Gly His Asn Trp Lys Leu Phe Glu Leu 
65 70 75 80 

Ser Asp lie Gly Asp Asp Ser Lys Pro Leu Cys Phe Glu Asn Cys Gly 
85 90 95 

Pro lie Gin Ser Ser Ala Ser Ala Thr lie Val Leu Tyr Ser Phe Pro 
100 105 110 

Glu Arg Val Glu Leu Asp Arg Leu Pro Thr Trp Gin Pro Val Gly Lys 
115 120 125 

Asn Leu Thr Leu Arg Cys Leu Val Asp Gly Gly Thr Pro Arg Ser Gin 
130 135 140 

Leu Ser Val Lys Leu Leu Arg Gly Gly Glu Val Leu His Gin Glu Pro 
145 150 155 160 

Val Gly Val Asp Ser Arg Asn Pro Lys Glu Val Thr Val Thr Val Leu 
165 170 175 

Ala Ser Arg Asp Asp His Gly Ala Asn Phe Ser Cys Arg Thr Glu Leu 
180 185 190 

Asp Leu Arg Pro Gin Gly Leu Ala Leu Phe Pro Asn Val Ser Val lie 
195 200 205 

Arg Gin Leu Trp Thr Phe Asp Leu Pro Val Thr Glu Pro Lys Leu Asp 
210 215 220 

Thr Pro Asp Leu Leu Glu Val Gly Thr Val Gin Lys Val Met Cys Ser 
225 230 235 240 

Leu Gly Gly Leu Phe Pro Ala Ala Glu Ala Arg lie Thr Leu Glu Leu 
245 250 255 
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Gly Gly His Thr Leu Thr Ser Lys Ser Thr Asn His Arg Asp Leu Val 
260 265 270 

Ser Ala Thr Ala Leu Val Thr Ala Glu Met Glu Gly Thr Gin Gin Leu 
275 280 285 

Arg Cys Val Leu Glu Leu Ala Asp Gin lie Leu Lys Ala Glu Arg Thr 
290 295 300 

Leu Ser lie Tyr Asn Phe Ser Ala Pro Val Leu Thr Leu Ser Gin Gin 
305 310 315 320 

Glu Val Ser Glu Gly Ser Gin Val Thr Val Lys Cys Glu Ala Gin Gly 
325 330 335 

Gly Ala Gin Val Arg Leu Ser Gly Ala Pro Pro Gly Gin Val Gin Phe 
340 345 350 

Thr Leu Asn Ala Ser Ser Glu Asp His Glu Arg lie Phe Thr Cys Ser 
355 360 365 

Ala Ala Leu Arg Val Ala Gly Gin Glu Leu Leu Lys Asn Gin Thr Leu 
370 375 380 

Lys Leu His Val Leu Tyr Gly Pro Arg Leu Asp Glu Asn Asp Cys Pro 
385 390 395 400 

Gly Asn Trp Thr Trp Pro Glu Gly Ser Gin Gin Asn Leu Ser Cys Gin 
405 410 415 

Ala Phe Gly Asn Pro Pro Pro Lys Leu Thr Cys Ser Arg Lys Thr Asp 
420 425 430 

Gly Ala Leu Leu Pro lie Gly Glu Val Lys Thr Val Thr Trp Ala Met 
435 440 445 

Asn Gly Thr Tyr Val Cys His Ala Val Ser Ser His Gly Asn lie Thr 
450 455 460 

Arg Glu Val Phe Leu Lys Val Leu Pro Lys Ser Pro He Trp Pro He 
465 470 475 480 

He He He Val Val He Leu Ala Thr Val Val Phe Val Gly Val Leu 
485 490 495 

Thr He Tyr He Tyr Asn Arg Gin Arg Lys He Arg He Tyr Lys Leu 
500 505 510 

Gin Arg Ala Gin Glu Glu Ala Met Lys Leu Lys Val Pro Pro His 
515 520 525 



<210> 73 
<211> 544 
<212> PRT 
<213> Bos taurus 



<400> 73 

Met He Ala Ser Gly Pro Pro Pro Arg Val Tyr Trp Thr Ser Leu He 
15 10 15 
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Phe Leu Leu Leu Ala Cys Cys Leu Leu Pro Thr Gly Ala Gin Gly Gin 
20 25 30 

Thr Tyr Gin Val Arg Val Glu Pro Lys Asp Pro Val Val Pro Phe Gly 
35 40 45 

Glu Pro Leu Val Val Asn Cys Thr Leu Asp Cys Pro Gly Pro Gly Leu 
50 55 60 

lie Ser Leu Glu Thr Ala Leu Ser Lys Glu Pro His Ser Arg Gly Leu 
65 70 75 80 

Gly Trp Ala Ala Phe Arg Leu Thr Asn Val Thr Gly Asp Met Glu lie 
85 90 95 

Leu Cys Ser Gly lie Cys Asn Lys Ser Gin Val Val Gly Phe Ser Asn 
100 105 110 

lie Thr Val Phe Gly Phe Pro Lys Arg Val Glu Leu Ala Pro Leu Pro 
115 120 125 

Leu Trp Gin Pro Val Gly Glu Glu Leu Asn Leu Ser Cys Leu Val Ser 
130 135 140 

Gly Gly Ala Pro Arg Ala His Leu Ser Val Val Leu Leu Arg Gly Glu 
145 150 155 160 

Glu Glu Leu Gly Arg Gin Pro Leu Gly Lys Glu Glu Pro Ala Lys Val 
165 170 175 

Thr Phe Met Val Gin Pro Arg Arg Glu Asp His Gly Thr Asn Phe Ser 
180 185 190 

Cys Arg Ser Glu Leu Asp Leu Arg Ser Gin Gly Leu Glu Leu Phe Gin 
195 200 205 

Asn Thr Ser Ala Pro Arg Lys Leu Gin Thr Tyr Ala Met Pro Lys Thr 
210 215 220 

Ala Pro Arg Leu Val Phe Pro Arg Phe Trp Glu Met Glu Thr Ser Trp 
225 230 235 240 

Pro Val Asn Cys Ser Leu Asn Gly Leu Phe Pro Ala Ser Glu Ala His 
245 250 255 

lie Gin Leu Ala Leu Gly Asn Gin Met Leu Asn Ala Thr Val Val Ser 
260 265 270 

His Ala Asp Thr Leu Thr Ala Thr Ala Thr Ala Lys Thr Glu Gin Glu 
275 280 285 

Gly Thr Gin Glu lie Val Cys Asn Val Thr Leu Gly Val Glu Asn Arg 
290 295 300 

Glu Thr Arg Glu Ser Leu Val Ala Tyr Arg Phe Gin Gly Pro Asn Leu 
305 310 315 320 



Asn Leu Ser Glu Ser Asn Ala Thr Glu Gly Thr Pro Val Thr Val Thr 
325 330 335 
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Cys Ala Ala Gly Pro Gin Val Gin Val Met Leu Asp Gly Val Pro Ala 
340 345 350 

Ala Val Pro Gly Gin Pro Ala Gin Leu Gin Leu Lys Ala Thr Glu Met 
355 360 365 

Asp Asp Arg Arg Thr Phe Phe Cys Asn Ala Thr Leu Lys Val His Gly 
370 375 380 

Val Thr Leu His Arg Asn Arg Ser lie Gin Leu Arg Val Leu Tyr Gly 
385 390 395 400 

Pro Thr lie Asp Arg Ala Lys Cys Pro Gin Arg Leu Met Trp Lys Glu 
405 410 415 

Lys Thr Met His lie Leu Gin Cys Gin Ala Arg Gly Asn Pro Asn Pro 
420 425 430 

Gin Leu Gin Cys Leu Arg Glu Gly Ser Lys Phe Lys Val Pro Val Gly 
435 440 445 

lie Pro Phe Leu Val Leu Leu Asn Tyr Ser Gly Thr Tyr Ser Cys Gin 
450 455 460 

Ala Ala Ser Ser Arg Gly Thr Asp Lys Met Leu Val Met Met Asp Val 
465 470 475 480 

Gin Gly Arg Asn Pro Val Thr lie Asn lie Val Leu Gly Val Leu Ala 
485 490 495 

lie Leu Gly Leu Val Thr Leu Ala Ala Ala Ser Val Tyr Val Phe Trp 
500 505 510 

Val Gin Arg Gin His Asp lie Tyr His Leu Thr Pro Arg Ser Thr Arg 
515 520 525 

Trp Arg Leu Thr Ser Thr Gin Pro Val Thr Val Ala Glu Glu Leu Ser 
530 535 540 



<210> 74 
<211> 537 
<212> PRT 
<213> Sus scrofa 

<400> 74 

Met Ala Pro Gly Ala Thr His Pro Gly Gin Leu Ala Leu Leu Ala Leu 
15 10 15 

Leu Leu Pro Leu Leu Gly Ala Leu Leu Pro Gly Leu Gly Gly Ala Glu 
20 25 30 

lie Ser Met Trp Pro Leu Asn Thr lie lie Pro Lys Gly Gly Ser Met 
35 40 45 

Lys Val Asn Cys Ser Val Ala Cys Asp Gly Asn lie Thr Ser Phe Gly 
50 55 60 



Leu Glu Thr His Trp His Lys Thr Glu Val Asp His Arg Asp Lys Trp 
65 70 75 80 
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Lys lie Phe Glu Leu Ser Asn Val Glu Asn Asp Gly Thr Leu Leu Cys 
85 90 95 

His Ala Val Cys Gin Gly Asn Gin Thr Gin Val Gin Gly Asn Leu Thr 
100 105 110 

Val Tyr Trp Phe Pro Glu Tyr Val Lys Leu Ala Asn Leu Ser Trp Gin 
115 120 125 

Arg Glu Gly Gin His Phe Asn Leu Ser Cys Gin Val Ser Gly Gly Ala 
130 135 140 

Pro Arg Thr Asn Leu Ser Ala Val Leu Phe Arg Gly Glu Glu Glu Leu 
145 150 155 160 

Phe Arg Gin Ser Val Gly Met Glu Glu Pro Ala Asn Val Thr Phe Arg 
165 170 175 

Met Leu Ala Ser Arg Lys Asp His Gly Ala Asn Phe Ser Cys Arg Thr 
180 185 190 

Glu Leu Asn Leu Gin Pro Gin Gly Leu Glu Leu Phe Trp Asn Ser Ser 
195 200 205 

Ala Pro Leu Lys Leu Gin Thr Tyr Val Leu Pro Ala Thr His Pro His 
210 215 220 

Leu Ala Thr Pro Glu Leu Val Glu Val Gly Thr Pro Val Ser Val Asn 
225 230 235 240 

Cys Ser Leu Asp Gly Leu Phe Pro Ala Ser Glu Ala Thr Val His Leu 
245 250 255 

Ala Arg Gly Asp His Arg Pro Pro Leu Thr lie Thr His Asn Gly Asp 
260 265 270 

Ser Leu Leu Ala Lys Thr Trp lie Asn Gly Thr Glu Lys Glu Gin Gly 
275 280 285 

Thr Gin Tyr Leu Val Cys Glu lie Met Leu Ala Asp Glu Lys Val Val 
290 295 300 

Thr Lys Lys Asn Val Thr Phe Tyr Ser Phe Pro Pro Pro Asn Leu Thr 
305 310 315 320 

Leu Ser Glu Pro Glu Val Ser Glu Gly Thr Thr Val Ser lie Glu Cys 
325 330 335 

Gin Ala His Gly Glu Ala Val Val Thr Leu Asn Glu Val Pro Ala Glu 
340 345 350 

Pro Pro Ser Gin Arg Ala Gin Leu Lys Leu Asn Val Ser Ala Glu Asp 
355 360 365 

His Gly Arg Ser Phe Ser Cys Ser Ala Ala Leu Thr Val Ala Gly His 
370 375 380 

Val Leu Tyr Lys Asn Gin Thr Gin Val Leu Ser Val Leu Tyr Gly Pro 
385 390 395 400 



Arg Leu Asp Glu Arg Asp Cys Pro Gly Asn Trp Thr Trp Pro Glu Gly 
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405 410 415 

Ser His Gin Thr Leu Thr Cys Gin Ala Arg Gly Asn Pro Thr Pro Lys 
420 425 430 

Leu lie Cys Arg Arg Glu Gly Asp Gly Ala Leu Leu Pro Thr Gly Asp 
435 440 445 

Leu Gly Pro Val Lys Arg Glu lie Thr Gly Thr Tyr Gin Cys Gin Ala 
450 455 460 

Thr Ser Ser Arg Gly Val Ala Thr Arg Val Val Val Val Asn Val lie 
465 470 475 480 

His Asn Gin Asn Asn Met Val lie lie lie Pro Val Ala Ala Val Ala 
485 490 495 

lie Leu Gly Ser Val Gly Val Ala Ala Tyr lie Tyr Asn Tyr Gin Arg 
500 505 510 

Lys lie Gin Lys Tyr Glu Leu Gin Lys Ala Gin Glu Asn Ala Ala Met 
515 520 525 

Lys Leu Ser Thr Pro Ala Ser Pro Pro 
530 535 



<210> 75 
<211> 912 
<212> PRT 

<213> Oryctolagus cuniculus 
<400> 75 

Met Pro Gly Pro Ser Pro Gly Leu Arg Ala Leu Leu Gly Phe Trp Val 
15 10 15 

Ala Leu Gly Leu Gly lie Leu Arg Leu Ser Ala Val Ala Gin Glu Pro 
20 25 30 

Phe Trp Ala Asp Leu Gin Pro Arg Val Ala Leu Val Glu Arg Gly Gly 
35 40 45 

Ser Leu Trp Leu Asn Cys Ser Thr Asn Cys Pro Arg Pro Glu Arg Gly 
50 55 60 

Gly Leu Glu Thr Ser Leu Arg Arg Asn Gly Pro Glu Gly Leu Arg Trp 
65 70 75 80 

Arg Ala Arg Gin Leu Val Asp lie Arg Glu Pro Glu Thr Gin Pro Val 
85 90 95 

Cys Phe Phe Arg Cys Ala Ala Thr Leu Gin Ala Arg Gly Leu lie Arg 
100 105 110 

Thr Phe Gin Arg Pro Asp Arg Val Glu Leu Val Pro Leu Pro Pro Trp 
115 120 125 

Gin Pro Val Gly Glu Asn Phe Thr Leu Ser Cys Arg Val Pro Gly Ala 
130 135 140 



Gly Pro Arg Gly Ser Leu Thr Leu Thr Leu Leu Arg Gly Ala Gin Glu 
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155 



PCT/US02/34197 
160 



Leu lie Arg Arg 



Val Leu Thr Ala 
180 

Phe Ser Cys Arg 
195 

Phe Glu Asn Ser 
210 

Leu Asp Ser Pro 
225 

Gin Ser Leu Val 



Ala Gly Val His 
260 

Thr Leu Glu Gly 
275 

Glu Glu Gly lie 
290 

Arg Arg Glu Ser 
305 

Leu Leu Thr Leu 



Val Thr Cys Thr 
340 

Pro Ala Ala Ala 
355 

Glu Ser Asp Asp 
370 

Asp Gly Glu Thr 
385 

Tyr Ala Pro Arg 



Pro Glu Gly Pro 
420 

Thr Pro Ala Val 
435 

Leu Gly Leu Leu 
450 

Cys Thr Ala Ala 
465 



Ser Phe Ala Gly 
165 

Thr Val Leu Ala 



Ala Glu Leu Asp 
200 

Ser Ala Pro Arg 
215 

Arg Leu Leu Ala 
230 

Ser Cys Thr Leu 
245 

Leu Ala Leu Gly 



Asp Ala lie Val 
280 

Lys Gin Leu Val 
295 

Arg Glu Asn Val 
310 

Ser Glu Pro Ser 
325 

Ala Gly Ala Arg 



Pro Gly Gin Pro 
360 

Gly Arg Ser Phe 
375 

Leu Ser Lys Asn 
390 

Leu Asp Asp Ala 
405 

Glu Gin Thr Leu 



His Cys Ala Arg 
440 

Gly Pro Val Thr 
455 

Asn Val Gin Gly 
470 



Glu Pro Ala Arg 
170 

Arg Arg Glu Asp 
185 

Leu Arg Pro Gin 



Gin Leu Trp Thr 
220 

Pro Arg Val Leu 
235 

Asp Gly Leu Phe 
250 

Asp Lys Arg Leu 
265 

Ala Thr Ala Thr 



Cys Ala Val Thr 
300 

Thr Val Tyr Ser 
315 

Ala Pro Glu Gly 
330 

Ala Leu Val Thr 
345 

Ala Gin Leu Gin 



Phe Cys Asp Ala 
380 

Gly Ser Ala Glu 
395 

Asp Cys Pro Arg 
410 

Arg Cys Glu Ala 
425 

Ser Asp Gly Gly 



Arg Ala Leu Ala 
460 

Glu Ala Val Lys 
475 



Ala Arg Gly Ala 
175 

His Gly Ala Asn 
190 

Gly Leu Ala Leu 
205 

Tyr Ala Leu Pro 



Glu Val Asp Ser 
240 

Pro Ala Ser Glu 
255 

Asn Pro Glu Val 
270 

Ala Thr Ala Glu 
285 

Leu Gly Gly Glu 



Phe Pro Ala Pro 
320 

Lys Leu Val Thr 
335 

Leu Glu Gly Val 
350 

Phe Asn Ala Ser 
365 

Thr Leu Glu Leu 



Leu Arg Val Leu 
400 

Ser Trp Thr Trp 
415 

Arg Gly Asn Pro 
430 

Ala Val Leu Ala 
445 

Gly Thr Tyr Arg 



Asp Val Thr Leu 
480 
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Thr Val Glu Tyr 



Val Thr Trp Leu 
500 

Gly Val Pro Pro 
515 

Val lie Glu Gly 
530 

Arg Cys Glu Ala 
545 

Thr Val Glu Tyr 



Trp Thr Trp Val 
580 

Gly Lys Pro Gin 
5 95 

Glu Gly Leu Leu 
610 

Pro Ser Val Pro 
625 

Thr Asn Pro Leu 



Ser Pro Pro Gin 
660 

Leu Glu Gly Ala 
675 

Ser Pro Arg Val 
690 

Pro Arg Val Ser 
705 

Asn Ala His Gly 



Arg Pro Val Val 
740 

Pro Gly Gly Asn 
755 

Ala Gin lie Ser 
770 

Ser Ser Asn Asn 
785 



Ala Pro Ala Leu 
485 

Glu Gly Thr Glu 



Pro Ser Val Ser 
520 

Leu Leu Leu Val 
535 

lie Asn Ala Arg 
550 

Gly Pro Ser Phe 
565 

Glu Gly Ser Glu 



Pro Ser Val Gin 
600 

Leu Pro Leu Ala 
615 

Arg Asp Leu Ala 
630 

Gly Ser Ala Val 
645 

Met Asp Asp Ser 



Glu Ala Ala Gly 
680 

Arg Cys Ser Arg 
695 

Arg Glu Asp Ala 
710 

Ser Asp Ser Arg 
725 

Ala Glu Leu Ala 



Phe Thr Leu Thr 
760 

Trp Arg Ala Pro 
775 

Ser Thr Leu Ser 
790 



Asp Ser Val Gly 
490 

Ala Ser Leu Ser 
505 

Cys Val Arg Phe 



Ala Arg Glu His 
540 

Ala Leu Ala Lys 
555 

Glu Glu Arg Ser 
570 

Gin Leu Phe Ser 
585 

Cys Val Gly Ser 



Pro Leu Asn Pro 
620 

Pro Gly lie Tyr 
635 

Lys Thr Val Val 
650 

Thr Cys Pro Ser 
665 

Pro Ala Cys Ala 



Glu Gly Ala Pro 
700 

Gly Thr Tyr Leu 
715 

Thr Val Thr Val 
730 

Ala Ser Pro Ser 
745 

Cys Arg Ala Glu 



Pro Gly Ala Pro 
780 

Val Pro Gly Ala 
795 



Cys Pro Glu Arg 
495 

Cys Val Ala His 
510 

Arg Gin Ala Asp 
525 

Ala Gly Thr Tyr 



Asn Val Ala Val 
560 

Cys Pro Ser Asn 
575 

Cys Glu Val Glu 
590 

Glu Gly Ala Ser 
605 

Ser Pro Ser Asp 



Val Cys Asn Ala 
640 

Val Ser Ala Glu 
655 

Asp Gin Thr Trp 
670 

Arg Gly Arg Pro 
685 

Arg Pro Ala Arg 



Cys Val Ala Thr 
720 

Gly Val Glu Tyr 
735 

Gly Gly Val Arg 
750 

Ala Trp Pro Pro 
765 

Asn lie Gly Leu 



Met Gly Ser His 
800 
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Gly Gly Glu Tyr 



Arg lie Thr Val 
820 

Gly Ala Val Gly 
835 

Phe Tyr Val Gin 
850 

Glu Ala Glu Ser 
865 

Gly Ala Gly Ser 



Glu Ser Pro Ala 
900 



Glu Cys Glu Ala 
805 

Arg Val Ala Gly 



Gly Ala Val Leu 
840 

Ser Thr Ala Cys 
855 

Ser Gly Glu Ala 
870 

Gly Ala Glu Gly 
885 

Gly Gly Glu Val 
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Thr Asn Ala His 
810 

Pro Trp Leu Trp 
825 

Leu Ala Ala Gly 



Lys Lys Gly Glu 
860 

Val Cys Leu Asn 
875 

Gly Pro Glu Ala 
890 

Phe Ala lie Gin 
905 



Gly His Ala Arg 
815 

lie Ala Val Gly 
830 

Ala Gly Leu Ala 
845 

Tyr Asn Val Gin 



Gly Ala Gly Gly 
880 

Glu Asp Ser Ala 
895 

Leu Thr Ser Ala 
910 



<210> 76 
<211> 917 
<212> PRT 

<213> Mus musculus 
<400> 76 

Met Pro Gly Pro Ser Pro Gly Leu Arg Arg Ala Leu Leu Gly Leu Trp 
15 10 15 

Ala Ala Leu Gly Leu Gly lie Leu Gly lie Ser Ala Val Ala Leu Glu 
20 25 30 

Pro Phe Trp Ala Asp Leu Gin Pro Arg Val Ala Leu Val Glu Pro Gly 
35 40 45 

Gly Ser Leu Trp Leu Asn Cys Ser Thr Asn Cys Pro Arg Pro Glu Arg 
50 55 60 

Gly Gly Leu Glu Thr Ser Leu Arg Arg Asn Gly Thr Gin Arg Gly Leu 
65 70 75 80 

Arg Trp Leu Ala Arg Gin Leu Val Asp lie Arg Glu Pro Glu Thr Gin 
85 90 95 

Pro Val Cys Phe Phe Arg Cys Ala Arg Arg Thr Leu Gin Ala Arg Gly 
100 105 110 

Leu lie Arg Thr Phe Gin Arg Pro Asp Arg Val Glu Leu Val Pro Leu 
115 120 125 

Pro Ser Trp Gin Pro Val Gly Glu Asn Phe Thr Leu Ser Cys Arg Val 
130 135 140 

Pro Gly Ala Gly Pro Arg Ala Ser Leu Thr Leu Thr Leu Leu Arg Gly 
145 150 155 160 



Gly Gin Glu Leu lie Arg Arg Ser Phe Val Gly Glu Pro Pro Arg Ala 
165 170 175 
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Arg Gly Ala Met 
180 

Arg Val Asn Phe 
195 

Leu Gly Leu Phe 
210 

Ala Met Pro Pro 
225 

Val Asp Ser Glu 



Ala Pro Glu Ala 
260 

Pro Asn Val Thr 
275 

Thr Ala Ser Ala 
290 

Thr Leu Gly Gly 
305 

Ser Phe Pro Thr 



Gly Lys Met Val 
340 

Thr Leu Glu Gly 
355 

Gin Leu Asn Val 
370 

Ala Ala Leu Asp 
385 

Glu Leu Arg Val 



Arg Ser Trp Thr 
420 

Ala Arg Gly Asn 
435 

Gly Ala Val Leu 
450 

Ala Gly Thr Tyr 
465 

Lys Asp Val Thr 



Leu Thr Ala Arg 



Ser Cys Leu Ala 
200 

Ala Asn Ser Ser 
215 

His Ser Pro Ser 
230 

Arg Pro Val Thr 
245 

Gly Val Tyr Leu 



Leu Asp Gly Asp 
280 

Glu Gin Glu Gly 
295 

Glu Thr Arg Glu 
310 

Pro Leu Leu Thr 
325 

Thr lie Ser Cys 



lie Pro Ala Ala 
360 

Thr Lys Asn Asp 
375 

Val Asp Gly Glu 
390 

Leu Tyr Ala Pro 
405 

Trp Pro Glu Gly 



Pro Glu Pro Ser 
440 

Ala Leu Gly Leu 
455 

Arg Cys Thr Ala 
470 

Leu Thr Val Glu 
485 
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Val Leu Ala Arg 
185 

Glu Leu Asp Leu 



Ala Pro Arg Gin 
220 

Leu lie Ala Pro 
235 

Cys Thr Leu Asp 
250 

Ser Leu Gly Asp 
265 

Ser Leu Val Ala 



Thr Lys Gin Leu 
300 

Thr Gin Glu Asn 
315 

Leu Ser Glu Pro 
330 

Trp Ala Gly Ala 
345 

Val Pro Gly Gin 



Asp Lys Arg Gly 
380 

Thr Leu Arg Lys 
395 

Arg Leu Asp Asp 
410 

Pro Glu Gin Thr 
425 

Val His Cys Ala 



Leu Gly Pro Val 
460 

Val Asn Gly Gin 
475 

Tyr Ala Pro Ala 
490 



Arg Glu Asp His 
190 

Arg Pro His Gly 
205 

Leu Arg Thr Phe 



Arg Val Leu Glu 
240 

Gly Leu Phe Pro 
255 

Gin Arg Leu Asn 
270 

Thr Ala Thr Ala 
285 

Met Cys Val Val 



Leu Thr Val Tyr 
320 

Glu Ala Pro Glu 
335 

Arg Ala Leu Val 
350 

Pro Ala Glu Leu 
365 

Phe Phe Cys Asp 



Asn Gin Ser Ser 
400 

Leu Asp Cys Pro 
415 

Leu His Cys Glu 
430 

Arg Pro Glu Gly 
445 

Thr Arg Ala Leu 



Gly Gin Ala Val 
480 

Leu Asp Ser Val 
495 



Gly Cys Pro Glu His lie Thr Trp Leu Glu Gly Thr Glu Ala Ser Leu 
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500 505 510 

Ser Cys Val Ala Pro Gly Val Pro Pro Pro Ser Val Ser Cys Val Arg 
515 520 525 

Ser Gly Lys Glu Glu Val Met Glu Gly Pro Leu Arg Val Ala Arg Glu 
530 535 540 

His Ala Gly Thr Tyr Arg Cys Glu Ala lie Asn Ala Arg Gly Ser Ala 
545 550 555 560 

Ala Lys Asn Val Ala Val Thr Val Glu Tyr Gly Pro Ser Phe Glu Glu 
565 570 575 

Leu Gly Cys Pro Ser Asn Trp Thr Trp Val Glu Gly Ser Gly Lys Leu 
580 585 590 

Phe Ser Cys Glu Val Asp Gly Lys Pro Glu Pro Arg Val Glu Cys Val 
595 600 605 

Gly Ser Glu Gly Ala Ser Glu Gly lie Val Leu Pro Leu Val Ser Ser 
610 615 620 

Asn Ser Gly Pro Arg Asn Ser Met Thr Pro Gly Asn Leu Ser Pro Gly 
625 630 635 640 

lie Tyr Leu Cys Asn Ala Thr Asn Arg His Gly Ser Thr Val Lys Thr 
645 650 655 

Val Val Val Ser Ala Glu Ser Pro Pro Gin Met Asp Glu Ser Ser Cys 
660 665 670 

Pro Ser His Gin Thr Trp Leu Glu Gly Ala Glu Ala Thr Ala Leu Ala 
675 680 685 

Cys Ser Ala Arg Gly Arg Pro Ser Pro Arg Val His Cys Ser Arg Glu 
690 695 700 

Gly Ala Ala Arg Leu Glu Arg Leu Gin Val Ser Arg Glu Asp Ala Gly 
705 710 715 720 

Thr Tyr Arg Cys Val Ala Thr Asn Ala His Gly Thr Asp Ser Arg Thr 
725 730 735 

Val Thr Val Gly Val Glu Tyr Arg Pro Val Val Ala Glu Leu Ala Ala 
740 745 750 

Ser Pro Pro Ser Val Arg Pro Gly Gly Asn Phe Thr Leu Thr Cys Arg 
755 760 765 

Ala Glu Ala Trp Pro Pro Ala Gin lie Ser Trp Arg Ala Pro Pro Gly 
770 775 780 

Ala Leu Asn Leu Gly Leu Ser Ser Asn Asn Ser Thr Leu Ser Val Ala 
785 790 795 800 

Gly Ala Met Gly Ser His Gly Gly Glu Tyr Glu Cys Ala Ala Thr Asn 
805 810 815 



Ala His Gly Arg His Ala Arg Arg lie Thr Val Arg Val Ala Gly Pro 
820 825 830 
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Trp Leu Trp Val Ala Val Gly Gly 
835 840 

Ala Ala Gly Ala Gly Leu Ala Phe 
850 855 

Lys Gly Glu Tyr Asn Val Gin Glu 
865 870 

Cys Leu Asn Gly Ala Gly Gly Thr 
885 

Thr Pro Gly Thr Ala Glu Ser Pro 
900 

Gin Leu Thr Ser Ser 
915 



Ala Ala Gly Gly Ala Ala Leu Leu 
845 

Tyr Val Gin Ser Thr Ala Cys Lys 
860 

Ala Glu Ser Ser Gly Glu Ala Val 
875 880 

Pro Gly Ala Glu Gly Gly Ala Glu 
890 895 

Ala Asp Gly Glu Val Phe Ala He 
905 910 



<210> 77 
<211> 548 
<212> PRT 

<213> Mus musculus 
<400> 77 

Met Lys Met Leu Leu Leu Gly Val Trp Thr Leu Leu Ala Leu He Pro 
15 10 15 

Cys Pro Gly Ala Ala Glu Glu Leu Phe Gin Val Ser Val His Pro Asn 
20 25 30 

Glu Ala Leu Val Glu Phe Gly His Ser Leu Thr Val Asn Cys Ser Thr 
35 40 45 

Thr Cys Pro Asp Pro Gly Pro Ser Gly He Glu Thr Phe Leu Lys Lys 
50 55 60 

Thr Gin Leu Ser Lys Gly Ser Gin Trp Lys Glu Phe Leu Leu Glu Asp 
65 70 75 80 

He Thr Glu Asp Leu Val Leu Gin Cys Phe Phe Ser Cys Ala Gly Glu 
85 90 95 

Gin Lys Asp Thr Val Leu Ala He Thr Met Tyr Gin Pro Pro Glu Gin 
100 105 110 

Val He Leu Asp Leu Gin Pro Glu Trp Val Ala Val Asp Glu Ala Phe 
115 120 125 

Thr Val Thr Cys His Val Pro Ser Val Ala Pro Leu Gin Ser Leu Thr 
130 135 140 

Leu Thr Leu Leu Gin Gly Asp Gin Glu Leu His Arg Lys Asp Phe Leu 
145 150 155 160 

Ser Leu Ser Leu Val Ser Gin Arg Ala Glu Val Thr Ala Thr Val Arg 
165 170 175 



Ala His Arg Asp Asn Asp Arg Arg Asn Phe Ser Cys Arg Ala Glu Leu 
180 185 190 
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Asp Leu Ser Pro 
195 

Lys Gin Leu Arg 
210 

Pro Ser Leu Leu 
225 

Thr Arg Val Phe 



Asp Gin Glu Leu 
260 

Ala Ser Ala Thr 
275 

Cys Leu Val Ser 
290 

Tyr Val Tyr Ser 
305 

Pro Leu Ala Gly 



Thr Ser Pro Ser 
340 

Ala Pro Gly Lys 
355 

Asp Gly Arg Thr 
370 

Arg Leu Val Arg 
385 

Arg Phe Gin Glu 



Met His Gin Met 
420 

Leu Val Cys Val 
435 

Lys Ala Thr Gin 
450 

Pro Leu Gly Ser 
465 

Pro Glu Gly lie 



Leu Gly Met Ala 
500 



His Gly Gly Gly 
200 

lie Phe Glu Phe 
215 

Glu Val Gly Lys 
230 

Pro Ala Gin Glu 
245 

Ser Pro Phe Ser 



lie Gin Ala Met 
280 

Leu Gly Pro Val 
295 

Phe Pro Pro Pro 
310 

Thr Asp Val Asn 
325 

Pro Thr Leu Arg 



Pro Ala Trp Leu 
360 

Leu Ser Cys Glu 
375 

Thr Thr Glu Ser 
390 

Ser Arg Cys Pro 
405 

Leu Ala Cys lie 



Trp Asn Gly Met 
440 

Asn His Thr Gly 
455 

Val Ser Lys Asp 
470 

Ser Ser Ser Thr 
485 

Val lie Thr Val 



Leu Phe His Gly 



Ser Gin Asn Pro 
220 

Ala Glu lie Val 
235 

Ala Val Phe Arg 
250 

Ser Trp Arg Glu 
265 

Glu Thr Gly Asp 



Glu Gin Lys Thr 
300 

lie Leu Glu lie 
315 

Val Thr Cys Ser 
330 

Leu Gin Gly Ser 
345 

Leu Phe Thr Ala 



Ala Ser Leu Glu 
380 

Gin Leu His Val 
395 

Gly Asn Gin lie 
410 

Pro Glu Gly Asn 
425 

lie Phe Asp Leu 



Thr Tyr Cys Cys 
460 

He Thr He He 
475 

He Phe He He 
490 

Ala Leu Tyr Leu 
505 



Ser Ser Ala Thr 
205 

Gin He Trp Val 



Ser Cys Glu Val 
240 

Met Phe Leu Glu 
255 

Asp Ala Ala Trp 
270 

Gin Glu Leu Thr 
285 

Arg Lys Pro Val 



Glu Asp Ala Tyr 
320 

Gly His Val Leu 
335 

Leu Asn His Ser 
350 

Arg Glu Glu Asp 
365 

Val Gin Gly Gin 



Leu Tyr Lys Pro 
400 

Trp Val Glu Gly 
415 

Pro Thr Pro Val 
430 

Asp Val Pro Gin 
445 

Thr Ala Thr Asn 



Val Gin Gly Leu 
480 

He He Phe Thr 
495 

Asn Tyr Gin Pro 
510 
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Cys Lys Gly Asn Ser Arg Lys Arg Met His Arg Pro Arg Glu Gin Ser 
515 520 525 

Lys Gly Glu Glu Ser Gin Phe Ser Asp lie Arg Ala Glu Glu Cys His 
530 535 540 

Ala His Leu Cys 
545 



<210> 78 
<211> 548 
<212> PRT 

<213> Rattus norvegicus 
<400> 78 

Met Lys Met Leu Leu Leu Gly lie Trp Thr Leu Leu Ala Leu lie Pro 
1 5 10 15 

Cys Pro Gly Thr Thr Glu Val Leu Phe Gin Val Ser Val His Pro Asn 
20 25 30 

Gin Ala Leu Val Glu Phe Gly His Ser Leu Thr lie Asn Cys Ser Thr 
35 40 45 

Thr Cys Pro Asp Pro Gly Pro Ser Gly lie Glu Thr Phe Leu Lys Lys 
50 55 60 

Thr Gin Leu Ser Lys Gly Ser Gin Trp Lys Glu Phe Leu Leu Glu Gly 
65 70 75 80 

lie Thr Glu Asn Ser Val Leu Gin Cys Phe Phe Ser Cys Ala Gly Val 
85 90 95 

Gin Lys Asp Thr Ala Leu Asp lie Thr Met Tyr Gin Pro Pro Glu Gin 
100 105 110 

Val lie Leu Asp Leu Gin Pro Glu Trp Val Ala lie Asp Glu Ala Phe 
115 120 125 

Thr Val Lys Cys His Val Pro Ser Val Ala Pro Leu Gin Ser Leu Thr 
130 135 140 

Leu Thr Leu Leu Gin Gly Asp Gin Glu Leu His Arg Lys Asp Phe Leu 
145 150 155 160 

Ser Leu Ser Leu Val Ser Gin Arg Ala Glu Val Thr Val Asn Val Arg 
165 170 175 

Ala Gin Arg Glu Asn Asp Arg His Asn Phe Ser Cys Arg Ala Glu Leu 
180 185 190 

Asp Leu Ser Pro His Gly Gly Gly Leu Phe His Gly Ser Ser Ala Thr 
195 200 205 

Lys Gin Leu Arg lie Phe Glu Phe Ser Gin Asn Pro Gin lie Leu Val 
210 215 220 



Pro Ser Leu Leu Glu Val Gly Met Ala Glu Thr Met Ser Cys Glu Val 
225 230 235 240 



WO 03/064992 PCT/US02/34197 

82/119 

Val Arg Val Phe Pro Ala Gin Glu Ala Val Phe Arg Met Phe Leu Glu 
245 250 255 

Asp Gin Glu Leu Ser Pro Phe Ser Ser Trp Lys Gly Asp Ala Ala Trp 
260 265 270 

Ala Ser Ala Thr lie Gin Ala Met Glu Thr Gly Asp Gin Glu Leu Thr 
275 280 285 

Cys Leu Val Ser Val Gly Pro Val Glu Gin Lys Ala Arg Lys Pro Val 
290 295 300 

His Val Tyr Ser Phe Pro Pro Pro Val Leu Glu lie Glu Asp Ala Tyr 
305 310 315 320 

Pro Gin Ala Gly Thr Asp Val Asn Val Thr Cys Ser Gly His Val Leu 
325 330 335 

Thr Ser Pro Ser Pro Thr Leu Arg Leu Gin Gly Ser Leu Asn Leu Ser 
340 345 350 

Ala Pro Gly Glu Pro Ala Trp Leu Arg Phe Thr Ala Arg Glu Glu Asp 
355 360 365 

Asp Gly Arg Thr Leu Ser Cys Glu Ala Ser Leu Val Val Gin Gly Gin 
370 375 380 

Arg Leu Val Lys Thr Thr Lys lie Gin Leu His Val Leu Tyr Lys Pro 
385 390 395 400 

Arg Phe Gin Glu Ser Asp Cys Pro Gly Asn Gin lie Trp Val Glu Gly 
405 410 415 

Met Asp Gin Met Leu Ala Cys lie Pro Glu Gly Asn Pro lie Pro Ala 
420 425 430 

Leu Val Cys lie Trp Asn Gly Met Thr Phe Asp Leu Glu Val Pro Gin 
435 440 445 

Lys Ala Thr Gin Asn His Thr Gly Thr Tyr Ser Cys Thr Ala Thr Asn 
450 455 460 

Ser Leu Gly Ser Val Ser Lys Asp lie Ala Val Leu Val Gin Gly Leu 
465 470 475 480 

His Glu Gly lie Ser Ser Ser Thr lie Phe lie lie lie He Phe Thr 
485 490 495 

Leu Gly Met Ala Val He Thr He Ala Leu Tyr Leu Asn Tyr Gin Pro 
500 505 510 

Cys Lys Arg Asn Gly Arg Lys Arg Thr His Arg Gin Lys Glu Gin Asn 
515 520 525 

Lys Gly Gly Glu Arg Gin Phe Ser Asp He Gin Ala Glu Glu Cys His 
530 535 540 



Ala His Leu Cys 
545 
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<210> 79 

<211> 396 

<212> PRT 

<213> Mus musculus 

<400> 79 

Met Gly Ala Pro Ser Ala Leu Pro Leu Leu Leu Leu Leu Ala Cys Ser 
15 10 15 

Trp Ala Pro Gly Gly Ala Asn Leu Ser Gin Asp Asp Ser Gin Pro Trp 
20 25 30 

Thr Ser Asp Glu Thr Val Val Ala Gly Gly Thr Val Val Leu Lys Cys 
35 40 45 

Gin Val Lys Asp His Glu Asp Ser Ser Leu Gin Trp Ser Asn Pro Ala 
50 55 60 

Gin Gin Thr Leu Tyr Phe Gly Glu Lys Arg Ala Leu Arg Asp Asn Arg 
65 70 75 80 

lie Gin Leu Val Ser Ser Thr Pro His Glu Leu Ser lie Ser lie Ser 
85 90 95 

Asn Val Ala Leu Ala Asp Glu Gly Glu Tyr Thr Cys Ser lie Phe Thr 
100 105 110 

Met Pro Val Arg Thr Ala Lys Ser Leu Val Thr Val Leu Gly lie Pro 
115 120 125 

Gin Lys Pro lie lie Thr Gly Tyr Lys Ser Ser Leu Arg Glu Lys Glu 
130 135 140 

Thr Ala Thr Leu Asn Cys Gin Ser Ser Gly Ser Lys Pro Ala Ala Gin 
145 150 155 160 

Leu Thr Trp Arg Lys Gly Asp Gin Glu Leu His Gly Asp Gin Thr Arg 
165 170 175 

lie Gin Glu Asp Pro Asn Gly Lys Thr Phe Thr Val Ser Ser Ser Val 
180 185 190 

Ser Phe Gin Val Thr Arg Glu Asp Asp Gly Ala Asn lie Val Cys Ser 
195 200 205 

Val Asn His Glu Ser Leu Lys Gly Ala Asp Arg Ser Thr Ser Gin Arg 
210 215 220 

lie Glu Val Leu Tyr Thr Pro Thr Ala Met lie Arg Pro Glu Pro Ala 
225 230 235 240 

His Pro Arg Glu Gly Gin Lys Leu Leu Leu His Cys Glu Gly Arg Gly 
245 250 255 

Asn Pro Val Pro Gin Gin Tyr Val Trp Val Lys Glu Gly Ser Glu Pro 
260 265 270 

Pro Leu Lys Met Thr Gin Glu Ser Ala Leu lie Phe Pro Phe Leu Asn 
275 280 285 



Lys Ser Asp Ser Gly Thr Tyr Gly Cys Thr Ala Thr Ser Asn Met Gly 



WO 03/064992 

290 295 



84/119 



PCT/US02/34197 

300 



Ser Tyr Thr Ala Tyr Phe Thr Leu 
305 310 

Pro Ser Ser Ser Ser Thr Tyr His 
325 

Phe lie Val Phe Leu Leu Leu lie 
340 

Leu lie Arg His Lys Gly Thr Tyr 
355 360 

Asp Asp Ala Pro Asp Ala Asp Thr 
370 375 

Gin Ser Gly Gly Asp Asp Lys Lys 
385 390 



Asn Val Asn Asp Pro Ser Pro Val 
315 320 

Ala lie He Gly Gly He Val Ala 
330 335 

Leu Leu He Phe Leu Gly His Tyr 
345 350 

Leu Thr His Glu Ala Lys Gly Ser 
365 

Ala He He Asn Ala Glu Gly Gly 
380 

Glu Tyr Phe He 
395 



<210> 80 
<211> 662 
<212> PRT 

<213> Homo sapiens 
<400> 80 

Met Glu Ser Lys Thr Glu Lys Trp Met Glu Arg He His Leu Asn Val 
15 10 15 

Ser Glu Arg Pro Phe Pro Pro His He Gin Leu Pro Pro Glu He Gin 
20 25 30 

Glu Ser Gin Glu Val Thr Leu Thr Cys Leu Leu Asn Phe Ser Cys Tyr 
35 40 45 

Gly Tyr Pro He Gin Leu Gin Trp Leu Leu Glu Gly Val Pro Met Arg 
50 55 60 

Gin Ala Ala Val Thr Ser Thr Ser Leu Thr He Lys Ser Val Phe Thr 
65 70 75 80 

Arg Ser Glu Leu Lys Phe Ser Pro Gin Trp Ser His His Gly Lys He 
85 90 95 

Val Thr Cys Gin Leu Gin Asp Ala Asp Gly Lys Phe Leu Ser Asn Asp 
100 105 110 

Thr Val Gin Leu Asn Val Lys His Thr Pro Lys Leu Glu He Lys Val 
115 120 125 

Thr Pro Ser Asp Ala He Val Arg Glu Gly Asp Ser Val Thr Met Thr 
130 135 140 

Cys Glu Val Ser Ser Thr Asn Pro Glu Tyr Thr Thr Val Ser Trp Leu 
145 150 155 160 

Lys Asp Gly Thr Ser Leu Lys Lys Gin Asn Thr Phe Thr Leu Asn Leu 
165 170 175 



Arg Glu Val Thr Lys Asp Gin Ser Gly Lys Tyr Cys Cys Gin Val Ser 
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180 

Asn Asp Val Gly 
195 

Tyr Ala Pro Glu 
210 

Glu Gly Ser Gin 
225 

Pro Thr Asn Tyr 



Thr Glu Glu Lys 
260 

Thr Tyr Ser Cys 
275 

Pro Gly Ala Glu 
290 

Val lie Gin Asn 
305 

Ser Cys Asn Tyr 



Lys Pro His Gly 
340 

Gin Lys Val Gly 
355 

Ser Trp Cys Ser 
370 

Pro Arg Asp Val 
385 

Ser Gly Asn Ser 



Lys Glu Val Gin 
420 

Glu Ser Gin Leu 
435 

Tyr Ser Cys Trp 
450 

Trp Thr Leu Glu 
465 

Ser Pro Gly Asp 



Glu Ser Asp Ala 
500 



Pro Gly 

Pro Ser 

Val Glu 
230 

Thr Trp 
245 

Val His 

Val Ala 

Leu Asp 

Pro Met 
310 

Asn Ser 
325 

Ala Trp 

Trp Asp 

Trp Ala 

Arg Val 
390 

Val Ser 
405 

Phe Phe 
Asn Phe 



Val Asn 



Val Leu 
470 

Gin Val 
485 

Asn Pro 



Arg Ser 
200 

Thr Val 
215 

Phe Leu 

Tyr His 

lie Pro 

Glu Asn 
280 

Val Gin 
295 

Pro lie 

Ser Asn 

Glu Glu 

Asn Thr 
360 

Ser Pro 
375 

Arg Lys 

Leu Gin 

Trp Glu 

Asp Ser 
440 

Asn Ser 
455 

Tyr Ala 
Met Glu 
Pro Val 



85/119 
185 

Glu Glu 

Gin He 

Cys Met 

Asn Gly 
250 

Lys He 
265 

He Leu 
Tyr Pro 



Arg Glu 

Pro Ser 
330 

Pro Ser 
345 

Thr He 

Val Ala 

He Lys 

Cys Asp 
410 

Lys Asn 
425 

He Ser 

He Gly 

Pro Arg 

Gly Lys 
490 

Ser His 
505 



Val Phe 

Leu His 
220 

Ser Leu 
235 

Lys Glu 
Leu Pro 



Gly Thr 

Pro Lys 
300 

Gly Asp 
315 

Val Thr 

Leu Gly 

Ala Cys 

Leu Asn 
380 

Pro Leu 
395 

Phe Ser 

Gly Arg 

Pro Glu 

Gin Thr 
460 

Arg Leu 
475 

Ser Ala 
Tyr Thr 



190 

Leu Gin 
205 

Ser Pro 

Ala Asn 

Met Gin 

Trp His 
270 

Gly Gin 
285 

Lys Val 

Thr Val 

Arg Tyr 

Val Leu 
350 

Ala Arg 
365 

Val Gin 

Ser Glu 

Ser Ser 

Leu Leu 
430 

Asp Ala 
445 

Ala Ser 

Arg Val 

Thr Leu 

Trp Phe 
510 



Val Gin 

Ala Val 

Pro Leu 
240 

Gly Arg 
255 

Ala Gly 

Arg Gly 

Thr Thr 

Thr Leu 
320 

Glu Trp 
335 

Lys He 

Cys Asn 

Tyr Ala 

He His 
400 

His Pro 
415 

Gly Lys 

Gly Ser 

Lys Ala 

Ser Met 
480 

Arg Cys 
495 

Asp Trp 
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Asn Asn Gin Ser Leu 
515 

Val Lys Val Gin His 
530 

Val Gly Lys Gly Arg 
545 

Pro Glu Thr lie Gly 
565 

Ala lie Leu lie Leu 
580 

Ala Glu Ser Ser Glu 
595 

Val Thr Tyr Ser Ala 
610 

Val lie Pro Asp Phe 
625 

lie Gin Phe Gly Val 
645 

Tyr Val lie Leu Lys 
660 



Pro Tyr His Ser Gin Lys 
520 

Ser Gly Ala Tyr Trp Cys 
535 

Ser Pro Leu Ser Thr Leu 
550 555 

Arg Arg Val Ala Val Gly 
570 

Ala lie Cys Gly Leu Lys 
5 85 

Met Gin Arg Pro Pro Arg 
600 

Leu His Lys Arg Gin Val 
615 

Pro Glu Asp Glu Gly lie 
630 635 

Gly Glu Arg Pro Gin Ala 
650 

His 



Leu Arg Leu Glu Pro 
525 

Gin Gly Thr Asn Ser 
540 

Thr Val Tyr Tyr Ser 
560 

Leu Gly Ser Cys Leu 
575 

Leu Gin Arg Arg Asp 
590 

Thr Cys Asp Asp Thr 
605 

Gly Asp Tyr Glu Asn 
620 

His Tyr Ser Glu Leu 
640 

Gin Glu Asn Val Asp 
655 



<210> 81 
<211> 505 
<212> PRT 

<213> Pan troglodytes 
<400> 81 

Gin Thr Ser Val Ser Pro Pro Lys Val lie Leu Pro Arg Gly Gly Ser 
1 5 10 15 

Val Gin Val Thr Cys Ser Thr Ser Cys Asp Gin Pro Asp Leu Leu Gly 
20 25 30 

lie Glu Thr Pro Leu Pro Lys Lys Glu Leu Leu Leu Gly Gly Asn Asn 
35 40 45 

Trp Lys Val Tyr Glu Leu Ser Asn Val Gin Glu Asp Ser Gin Pro Met 
50 55 60 

Cys Tyr Ser Asn Cys Pro Asp Gly Gin Ser Thr Ala Lys Thr Phe Leu 
65 70 75 80 

Thr Val Tyr Trp Thr Pro Glu Arg Val Glu Leu Ala Pro Leu Pro Ser 
85 90 95 

Trp Gin Pro Val Gly Lys Asp Leu Thr Leu Arg Cys Gin Val Glu Gly 
100 105 110 



Gly Ala Pro Arg Ala Asn Leu Thr Val Val Leu Leu Arg Gly Glu Lys 
115 120 125 
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Glu Leu Lys Arg 
130 

Thr Val Leu Val 
145 

Thr Glu Leu Asp 



Ser Ala Pro His 
180 

Gin Leu Val Ser 
195 

Val Cys Ser Leu 
210 

Leu Ala Leu Gly 
225 

Asp Ser Phe Ser 



Gly Thr Gin Arg 
260 

Glu Thr Leu Gin 
275 

lie Leu Thr Lys 
290 

Cys Glu Ala His 
305 

Gin Pro Val Gly 



Asp Asn Gly Arg 
340 

Gin Leu lie His 
355 

Pro Arg Leu Asp 
370 

Asn Ser Gin Gin 
385 

Glu Leu Lys Cys 



Ser Val Thr Val 
420 

Arg Ser Thr Gin 
435 



Glu Pro Ala Val 
135 

Glu Arg Asp His 
150 

Leu Arg Pro Gin 
165 

Gin Leu Gin Thr 



Pro Arg Val Leu 
200 

Asp Gly Leu Phe 
215 

Asp Gin Arg Leu 
230 

Ala Lys Ala Ser 
245 

Leu Thr Cys Ala 



Thr Val Thr He 
280 

Pro Glu Val Ser 
295 

Pro Arg Ala Lys 
310 

Pro Arg Val Gin 
325 

Ser Phe Ser Cys 



Lys Asn Gin Thr 
360 

Glu Arg Asp Cys 
375 

Thr Pro Met Cys 
390 

Leu Lys Asp Gly 
405 

Thr Arg Asp Leu 



Gly Glu Val Thr 
440 



Gly Glu Pro Ala 
14 0 

His Gly Ala Asn 
155 

Gly Leu Gin Leu 
170 

Phe Val Leu Pro 
185 

Glu Val Asp Thr 



Pro Val Leu Glu 
220 

Asn Pro Thr Val 
235 

Val Ser Val Thr 
250 

Val He Leu Gly 
265 

Tyr Ser Phe Pro 



Glu Gly Thr Glu 
300 

Val Thr Leu Asn 
315 

Leu Leu Leu Lys 
330 

Ser Ala Thr Leu 
345 

Arg Glu Leu Arg 



Pro Gly Asn Trp 
380 

Gin Ala Ser Gly 
395 

Thr Phe Pro Leu 
410 

Glu Gly Thr Tyr 
425 

Arg Lys Val Thr 



Glu Val Thr Thr 



Phe Ser Cys Arg 
160 

Phe Glu Asn Thr 
175 

Ala Thr Pro Pro 
190 

Gin Gly Thr Val 
205 

Ala Gin Val His 



Thr Tyr Gly Asn 
240 

Ala Glu Asp Glu 
255 

Asn Gin Ser Arg 
270 

Ala Pro Asn Val 
285 

Val Thr Val Lys 



Gly Val Pro Ala 
320 

Ala Thr Pro Glu 
335 

Glu Val Ala Gly 
350 

Val Leu Tyr Gly 
365 

Thr Trp Pro Glu 



Asn Pro Leu Pro 
400 

Pro Val Gly Glu 
415 

Leu Cys Arg Ala 
430 

Val Asn Val Leu 
445 
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Ser Pro Arg Tyr Glu He Val He lie Thr Val Val Ala Ala Ala Val 
450 455 460 

He Met Gly Thr Ala Gly Leu Ser Thr Tyr Leu Tyr Asn Arg Gin Arg 
465 470 475 480 

Lys He Arg Lys Tyr Arg Leu Gin Gin Ala Gin Lys Gly Thr Pro Met 
485 490 495 

Lys Pro Asn Thr Gin Ala Thr Pro Pro 
500 505 



<210> 82 
<211> 447 
<212> PRT 

<213> Mus musculus 
<220> 

<221> M0D_RES 
<222> (12) 

<22 3> Any amino acid 
<220> 

<221> M0D_RES 

<222> (77) . . (80) 

<22 3> Any amino acid 

<220> 

<221> MOD_RES 
<222> (145) . . (147) 
<223> Any amino acid 

<220> 

<221> M0D_RES 
<222> (155) . . (163) 
<223> Any amino acid 

<220> 

<221> M0D_RES 
<222> (268) . . (269) 
<223> Any amino acid 

<220> 

<221> M0D_RES 

<222> (279) 

<223> Any amino acid 

<220> 

<221> M0D_RES 

<222> (282) 

<223> Any amino acid 

<400> 82 

Glu Asp Ser Gin Pro Met Cys Tyr Ser Asn Cys Xaa Asp Gly Gin Ser 
15 10 15 

Thr Ala Lys Thr Phe Leu Thr Val Tyr Trp Thr Pro Glu Arg Val Glu 
20 25 30 



Leu Ala Pro Leu Pro Ser Trp Gin Pro Val Gly Lys Asn Leu Thr Leu 
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Arg Cys Gin Val 
50 

Leu Leu Arg Gly 
65 

Pro Ala Glu Val 



Ala Asn Phe Ser 
100 

Glu Leu Phe Glu 
115 

Leu Pro Ala Thr 
130 

Xaa Xaa Xaa Gly 
145 

Xaa Xaa Xaa Gin 



Thr Val Thr Tyr 
180 

Val Thr Ala Glu 
195 

Leu Gly Asn Gin 
210 

Phe Pro Ala Pro 
225 

Thr Glu Val Thr 



Leu Asn Gly Val 
260 

Leu Lys Ala Thr 
275 

Thr Leu Glu Val 
290 

Leu Arg Val Leu 
305 

Asn Trp Thr Trp 



Trp Gly Asn Pro 
340 

Pro Leu Pro lie 
355 



Glu Gly Gly Ala 
55 

Glu Lys Glu Leu 
70 

Thr Thr Thr Val 
85 

Cys Arg Thr Glu 



Asn Thr Ser Ala 
120 

Pro Pro Gin Leu 
135 

Thr Val Val Cys 
150 

Val His Leu Ala 
165 

Gly Asn Asp Ser 



Asp Glu Gly Thr 
200 

Ser Gin Glu Thr 
215 

Asn Val lie Leu 
230 

Val Lys Cys Glu 
245 

Pro Ala Gin Pro 



Pro Glu Xaa Asn 
280 

Ala Gly Gin Leu 
295 

Tyr Gly Pro Arg 
310 

Pro Glu Asn Ser 
325 

Leu Pro Glu Leu 



Gly Glu Ser Val 
360 



Pro Arg Ala Asn 
60 

Lys Arg Glu Pro 
75 

Leu Val Arg Arg 
90 

Leu Asp Leu Arg 
105 

Pro Tyr Gin Leu 



Val Ser Pro Arg 
140 

Ser Leu Xaa Xaa 
155 

Leu Gly Asp Gin 
170 

Phe Ser Ala Lys 
185 

Gin Arg Leu Thr 



Leu Gin Thr Val 
220 

Thr Lys Pro Glu 
235 

Ala His Pro Arg 
250 

Leu Gly Pro Xaa 
265 

Gly Xaa Ser Phe 



lie His Lys Asn 
300 

Leu Asp Glu Arg 
315 

Gin Gin Thr Pro 
330 

Lys Cys Leu Lys 
345 

Thr Val Thr Arg 



Leu Thr Val Val 



Xaa Xaa Xaa Xaa 
80 

Asp His His Gly 
95 

Pro Gin Gly Leu 
110 

Gin Thr Phe Val 
125 

Val Leu Glu Val 



Xaa Xaa Xaa Xaa 
160 

Arg Leu Asn Pro 
175 

Ala Ser Val Ser 
190 

Cys Ala Val lie 
205 

Thr lie Tyr Ser 



Val Ser Glu Gly 
240 

Ala Lys Val Thr 
255 

Xaa Gin Leu Leu 
270 

Ser Cys Ser Ala 
285 

Gin Thr Arg Glu 



Asp Cys Pro Gly 
320 

Met Cys Gin Ala 
335 

Asp Gly Thr Phe 
350 

Asp Leu Glu Gly 
365 
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Thr Tyr Leu Cys Arg Ala Arg Ser 
370 375 

Val Thr Val Asn Val Leu Ser Pro 
385 390 

Val Val Ala Ala Ala Val lie Met 
405 

Leu Tyr Asn Arg Gin Arg Lys lie 
420 

Gin Lys Gly Thr Pro Met Lys Pro 
435 440 



Thr Gin Gly Glu Val Thr Arg Glu 
380 

Arg Tyr Glu He Val He He Thr 
395 400 

Gly Thr Ala Gly Leu Ser Thr Tyr 
410 415 

Lys Lys Tyr Arg Leu Gin Gin Ala 
425 430 

Asn Thr Gin Ala Thr Pro Pro 
445 



<210> 83 
<211> 528 
<212> PRT 

<213> Canis familiaris 
<400> 83 

Ala Pro Ala Leu Pro Arg Leu Pro Ala Leu Leu Ala Leu Leu Gly Ala 
15 10 15 

Leu Leu Pro Gly Leu Gly Gly Ala Gin Thr Ser Val Asp Pro Ala Glu 
20 25 30 

Ala He He Leu Arg Gly Gly Ser Val Gin Val Asn Cys Ser Thr Ser 
35 40 45 

Cys Asn Gin Thr Ser He Phe Gly Leu Glu Thr Leu Leu Thr Lys Thr 
50 55 60 

Glu Val Thr Ser Gly Asp Asn Trp Val Leu Phe Glu Leu Thr Asp Val 
65 70 75 80 

Gin Glu Asp Ser Lys Leu He Cys Phe Ser Asn Cys His Asp Glu Thr 
85 90 95 

Met Ala Pro He Asp Leu Thr Val Tyr Trp Phe Pro Glu Arg Val Glu 
100 105 110 

Leu Ala Pro Leu Pro Arg Trp Gin Pro Val Gly Glu Asn Leu Thr Met 
115 120 125 

Thr Cys Gin Val Ala Gly Gly Ala Pro Arg Thr Asn Leu Thr Val Val 
130 135 140 

Leu Leu Arg Gly Glu Glu Glu Leu Ser Arg Gin Pro Ala Val Gly Glu 
145 150 155 160 

Pro Ala Glu Val Thr Phe Thr Val Ala Val Gly Arg Glu Asp His Leu 
165 170 175 

Ala Asn Phe Ser Cys Arg Thr Asp Leu Asp Leu Arg His Arg Gly Leu 
180 185 190 



Gly Leu Phe Gin Asn Ser Ser Ala Pro Arg Gin Leu Gin Thr Phe Val 
195 200 205 
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Leu Pro Glu Thr 
210 

Gly Thr Gin Trp 
225 

Ser Glu Ala Gin 



Thr Val Leu Tyr 
260 

Ala Asn Pro Glu 
275 

Leu Gly Asp Glu 
290 

Phe Pro Ala Pro 
305 

Thr Thr Val Thr 



Leu Ser Gly Leu 
340 

Leu Asn Ala Ser 
355 

Ala Leu Glu Val 
370 

Leu His Val Leu 
385 

Asn Trp Thr Trp 



Trp Gly Asn Pro 
420 

Ala Leu Leu Pro 
435 

Gly Thr Tyr Leu 
450 

Glu Val Val lie 
465 

lie Leu Val Thr 



Tyr Leu Tyr Asn 
500 

Ala Gin Glu Ala 
515 



Pro Pro Arg Leu 
215 

Ser Val Asp Cys 
230 

Val His Leu Ala 
245 

Lys Lys Asp Ser 



Asp Glu Gly Thr 
280 

Asn Arg Arg Trp 
295 

Asn Leu Thr Leu 
310 

Val Glu Cys Glu 
325 

Pro Ser Gly Leu 



Ala Ala Asp Asn 
360 

Ala Gly His Met 
375 

Tyr Gly Pro Arg 
390 

Glu Glu Gly Phe 
405 

Val Pro Glu Leu 



lie Gly Asp Leu 
440 

Cys Gin Ala Arg 
455 

Asn Val lie Tyr 
470 

Thr lie Val lie 
485 

Arg Gin Arg Lys 



Ala Ala Met Lys 
520 



Ala Thr Pro Pro 
220 

Thr Met Asp Gly 
235 

Leu Ala Glu Glu 
250 

Leu Leu Ala Thr 
265 

Gin Gin Leu Trp 



Gin Glu Asn Val 
300 

Ser Glu Pro Glu 
315 

Ala Pro Ala Gly 
330 

Ala Val Pro Arg 
345 

Arg Arg Ser Phe 



Leu Gin Lys Asn 
380 

Leu Asp Gin Arg 
395 

His Gin Thr Leu 
410 

Lys Cys His Arg 
425 

Arg Pro Val Lys 



Ser Pro Arg Gly 
460 

His Gin Asn Asn 
475 

Leu Gly Thr Val 
490 

lie Gin Lys Tyr 
505 

Leu Asn Thr Pro 



lie Val Glu Val 



Val Phe Pro Ala 
240 

Arg Leu His Ser 
255 

Ala Asn Val Lys 
270 

Cys Glu Val Thr 
285 

Thr Phe Tyr Ser 



Val Ser Glu Trp 
320 

Val Val Val Ser 
335 

Ala Gin Phe Gin 
350 

Ser Cys Ser Ala 
365 

Gin Thr Arg Glu 



Asp Cys Pro Gly 
400 

Lys Cys Gin Ala 
415 

Lys Gly Asp Asp 
430 

Arg Glu Val Ala 
445 

Glu lie Thr Arg 



lie Leu lie lie 
480 

Ser Val Ala Ala 
495 

Lys Leu Gin Lys 
510 

Ala Thr Pro Pro 
525 
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<210> 84 
<211> 535 
<212> PRT 
<213> Bos taurus 

<400> 84 

Met Ala Leu Gly Ala Ala Pro Ala Ala Gin Leu Ala Leu Leu Ala Leu 
15 10 15 

Leu Gly Thr Leu Leu Pro Gly Pro Gly Gly Ala Gly lie Ser lie His 
20 25 30 

Pro Ser Lys Ala lie lie Pro Arg Gly Asp Ser Leu Thr Val Asn Cys 
35 40 45 

Ser Asn Ser Cys Asp Gin Lys Ser Thr Phe Gly Leu Glu Thr Val Leu 
50 55 60 

lie Lys Glu Glu Val Gly Arg Gly Asp Asn Trp Lys Val Phe Gin Leu 
65 70 75 80 

Arg Asp Val Gin Glu Asp lie Glu Leu Phe Cys Tyr Ser Asn Cys His 
85 90 95 

Lys Glu Gin Thr lie Ala Ser Met Asn Leu Thr Val Tyr Trp Phe Pro 
100 105 110 

Glu His Val Glu Leu Ala Pro Leu Pro Leu Trp Gin Pro Val Gly Glu 
115 120 125 

Glu Leu Asn Leu Ser Cys Leu Val Ser Gly Gly Ala Pro Arg Ala His 
130 135 140 

Leu Ser Val Val Leu Leu Arg Gly Glu Glu Glu Leu Gly Arg Gin Pro 
145 150 155 160 

Val Gly Lys Gly Glu Pro Ala Lys Val Met Phe Thr Val Gin Ser Arg 
165 170 175 

Arg Glu Asp His Gly Thr Asn Phe Ser Cys Arg Trp Glu Leu Asp Leu 
180 185 190 

Arg Ser Gin Gly Leu Glu Leu Phe Gin Asn Thr Ser Ala Pro Arg Lys 
195 200 205 

Leu Gin Thr Tyr Val Leu Pro Ser lie Asp Pro His Leu Glu Val Pro 
210 215 220 

Pro lie Val Glu Val Gly Ser Arg Trp Pro Val Asn Cys Thr Leu Asp 
225 230 235 240 

Gly Leu Phe Pro Ala Ser Asp Ala Lys Val Tyr Leu Val Leu Gly Asp 
245 250 255 

Gin Lys Leu Glu Ser Asn lie Thr Tyr Asp Gly Asp Ser Val Leu Ala 
260 265 270 

Lys Ala Trp Met Glu Glu Asn Glu Glu Gly Thr His Ser Leu Lys Cys 
275 280 285 
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Ser Val Thr Leu Gly 
290 

Val Tyr Ser Phe Pro 
305 

Ser Glu Trp Thr Thr 
325 

Val Val Lys Leu Asn 
340 

Gin Leu Lys Leu Asn 
355 

Cys Ser Ala Ala Leu 
370 

Thr Leu Glu Leu His 
385 

Cys Pro Gly Asn Trp 
405 

Cys Glu Ala Gin Gly 
420 

Gly Asp Gly Ala Ser 
435 

Glu Val Ala Gly Thr 
450 

Val Thr Arg Glu Val 
465 

Asp lie Val lie Pro 
485 

Gly Thr Ala Gly Tyr 
500 

Glu Leu Gin Lys Ala 
515 

Asn Ala Gin Ser Thr 
530 



93/119 

Glu Val Ser Arg Arg Thr 
295 

Leu Pro Thr Leu Thr Leu 
310 315 

Val Thr Val Glu Cys Val 
330 

Gly Thr Ser Ala Val Pro 
345 

Ala Ser Ala Ser Asp His 
360 

Glu He Ala Gly Gin Val 
375 

Val Leu Tyr Gly Pro Arg 
390 395 

Thr Trp Gin Glu Gly Ser 
410 

Asn Pro He Pro Lys Leu 
425 

Leu Pro He Gly Asp Leu 
440 

Tyr Leu Cys Arg Ala Thr 
455 

Val Leu Asn Val Leu His 
470 475 

Val Val Ala Val Thr Leu 
490 

Val Tyr Asn Tyr Gin Arg 
505 

Arg Lys Ala Gin Glu Glu 
520 

Pro Pro 
535 



Gin Glu Asn Val Thr 
300 

Ser Pro Pro Glu Val 
320 

Thr Arg Asp Gly Ala 
335 

Pro Gly Pro Arg Ala 
350 

Arg Ser Asn Phe Ser 
365 

Val His Lys His Gin 
380 

Leu Asp Gin Arg Asp 
400 

Glu Gin Thr Leu Lys 
415 

Asn Cys Ser Arg Lys 
430 

Arg Pro Val Arg Arg 
445 

Ser Ala Arg Gly Arg 
460 

Gly Gin Asn lie Leu 
480 

He Leu Gly Ala Leu 
495 

Lys He Gin Lys Tyr 
510 

Ala Ala Leu Lys Leu 
525 



<210> 85 
<211> 530 
<212> PRT 
<213> Ovis aries 

<400> 85 

Met Ala Pro Gly Ala Ala Pro Ala Ala Leu Leu Ala Leu Leu Val Leu 
15 10 15 

Leu Gly Thr Leu Leu Pro Gly Ser Gly Gly Ala Glu He Ser He His 
20 25 30 
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Pro Pro Lys Ala lie lie Pro Arg Gly Gly Ser Leu Arg Val Asn Cys 
35 40 45 

Ser lie Ser Cys Asp Arg Lys Thr Thr Phe Gly Leu Glu Thr Val Leu 
50 55 60 

Asn Lys Glu Glu Val Ser Arg Gly Pro Asn Trp Lys Val Phe Glu Leu 
65 70 75 80 

Ser Asp Val Gin Glu Glu lie Asn Pro Leu Cys Tyr Ser Asn Cys His 
85 90 95 

Gly Glu Gin lie Val Ala Ser Met Asn Leu Thr lie Tyr Trp Phe Pro 
100 105 ^ 110 

Glu Arg Val Glu Leu Ala Pro Leu Pro Leu Trp Gin Pro Val Gly Glu 
115 120 125 

Glu Leu Asn Leu Ser Cys Gin Val Ser Gly Gly Gly Pro Arg His His 
130 135 "* * 140 

Leu Ser Met Val Leu Leu Arg Gly Glu Glu Glu Leu Asp Arg Gin Pro 
145 150 155 " 160 

Val Gly Lys Glu Glu Pro Ala Glu Val Thr Phe Met Val Gin Pro Arg 
165 170 175 

Arg Glu Asp His Gly Thr Ser Phe Ser Cys Arg Trp Glu Leu Asp Leu 
180 185 ~ 190 

Arg Ser Gin Gly Leu Glu Leu Phe Gin Asn Thr Ser Ala Pro Arg Lys 
195 200 205 

Leu Gin Thr Tyr Val Leu Pro Ser Thr Asp Pro His Leu Glu Ala Pro 
210 215 220 

Pro Val Val Glu Val Gly Ser Arg Trp Pro Val Lys Cys Thr Leu Asp 
225 230 235 240 

Gly Leu Phe Pro Ala Ser Asp Ala Glu Val Tyr Val Gin Leu Gly Asp 
245 250 255 

Gin Lys Leu Glu Ser Asn lie Thr Tyr Asn Gly Asp Ser Val Leu Ala 
260 265 270 

Glu Ala Trp Thr Glu Glu Asn Glu Glu Gly Thr His Ser Leu Arg Cys 
275 280 285 

Ser Val Ser Leu Gly Glu Lys lie Arg Arg Thr Arg Gly Ser Val Thr 
290 295 300 

Met Tyr Ser Phe Pro Leu Pro Thr Leu Thr Leu Ser Pro Pro Glu Val 
305 310 315 320 

Ser Glu Trp Thr Thr Val Thr Val Glu Cys Val Thr Arg Asp Gly Ala 
325 330 ^ 335 

Val Val Arg Leu Asn Gly Val Ser Ala Glu Pro Pro Gly Pro Arg Ala 
340 345 350 

Gin Leu Lys Leu Asn Val Ser Ala Asp Asp His Gly Ser Asn Phe Ser 
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Cys Ser Ala Ala 
370 

Thr Arg Glu Leu 
385 

Cys Leu Gly Asn 



Cys Ala Ala Arg 
420 

Gly Asp Gly Ala 
435 

Glu Val Ala Gly 
450 

Val Thr Arg Glu 
465 

Asp lie Val lie 



Gly Thr Ala Gly 
500 

Glu Leu Gin Lys 
515 



Leu Lys lie Ala 
375 

His Val Leu Tyr 
390 

Trp Thr Trp Gin 
405 

Gly Asn Pro lie 



Ser Leu Pro lie 
440 

Thr Tyr Leu Cys 
455 

Val Val Leu Asn 
470 

Pro Val Val Ala 
485 

Tyr lie Tyr Asn 



Ala Gin Lys Glu 
520 



Gly Gin Glu Val 
380 

Gly Pro Arg Leu 
395 

Glu Gly Ser Glu 
410 

Pro Lys Leu Asn 
425 

Gly Asp Leu Arg 



Trp Ala Thr Ser 
460 

Val Leu Tyr Gly 
475 

Val Thr Leu lie 
490 

Tyr Gin Arg Lys 
505 

Ala Ala Leu Lys 



His Lys lie Gin 



Asp Gin Arg Asp 
400 

Gin Thr Leu Lys 
415 

Cys Ser Arg Lys 
430 

Pro Val Thr Arg 
445 

Ala Arg Gly Gly 



Gin Asn lie Leu 
480 

Leu Gly Thr Leu 
495 

lie Gin Lys Tyr 
510 

Leu Lys Ser Thr 
525 



Pro Pro 
530 



<210> 86 
<211> 545 
<212> PRT 

<213> Rattus norvegicus 
<400> 86 

Met Ala Ser Thr Arg Ala Arg Pro Met Leu Pro Leu Leu Leu Val Leu 
15 10 15 

Val Ala Val Val lie Pro Gly Pro Val Gly Ala Gin Val Ser lie His 
20 25 30 

Pro Thr Glu Ala Phe Leu Pro Arg Gly Gly Ser Val Gin Val Asn Cys 
35 40 45 

Ser Ser Ser Cys Glu Asp Glu Asn Leu Gly Leu Gly Leu Glu Thr Asn 
50 55 60 

Trp Met Lys Asp Glu Leu Ser Ser Gly His Asn Trp Lys Leu Phe Lys 
65 70 75 80 

Leu Ser Asp lie Gly Glu Asp Ser Arg Pro Leu Cys Phe Glu Asn Cys 
85 90 95 

Gly Thr Thr Gin Ser Ser Ala Ser Ala Thr lie Thr Val Tyr Ser Phe 
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Pro Glu Arg Val 
115 

Lys Asn Leu lie 
130 

Gin Leu Ser Val 
145 

Ala Val Asp Gly 



Arg Gly Asp His 
180 

Arg Pro Gin Gly 
195 

Leu Arg Thr Phe 
210 

Asp Leu Leu Glu 
225 

Gly Leu Phe Pro 



Gin Met Leu Thr 
260 

Thr Ala Ser Val 
275 

Arg Cys Val Leu 
290 

Leu Arg lie Tyr 
305 

Glu Val Ser Glu 



Gly Ala Gin Val 
340 

Ser Gin Gly Thr 
355 

Leu Asn Ala Ser 
370 

Ala Leu Glu Val 
385 

Leu His Val Leu 



Asn Trp Thr Trp 
420 



Glu Leu Asp Pro 
120 

Leu Arg Cys Leu 
135 

Val Leu Leu Arg 
15 0 

Asp Pro Lys Glu 
165 

Gly Ala Asn Phe 



Leu Ser Leu Phe 
200 

Asp Leu Pro Thr 
215 

Val Gly Thr Gin 
230 

Ala Ser Glu Ala 
245 

Leu Glu Ser Thr 



Glu Val Thr Glu 
280 

Glu Leu Ala Asp 
295 

Asn Phe Ser Ala 
310 

Gly Asp Gin Val 
325 

Val Leu Leu Asn 



Ser Pro Arg Pro 
360 

Pro Glu Asp His 
375 

Asp Gly Lys Ser 
390 

Tyr Gly Pro His 
405 

Gin Glu Gly Ser 



Leu Pro Ala Trp 



Val Glu Gly Gly 
140 

Gly Asn Glu Thr 
155 

lie Thr Phe Thr 
170 

Ser Cys Phe Thr 
185 

Lys Asn Val Ser 



Arg Val Leu Lys 
220 

Gin Lys Phe Leu 
235 

Gin lie Tyr Leu 
250 

Asn Ser Arg Asp 
265 

Lys Leu Asp Arg 



Gin Thr Leu Glu 
300 

Pro lie Leu Thr 
315 

Thr Val Lys Cys 
330 

Ser Thr Ser Pro 
345 

Pro Thr Ser Gin 



Lys Arg Arg Phe 
380 

Leu Phe Lys Asn 
395 

Leu Asp Lys Lys 
410 

Gin Gin Thr Leu 
425 



Gin Gin Val Gly 
125 

Ala Pro Arg Thr 



Leu Ser Arg Gin 
160 

Val Leu Ala Ser 
175 

Glu Leu Asp Leu 
190 

Glu Val Arg Gin 
205 

Leu Asp Thr Pro 



Cys Ser Leu Glu 
240 

Glu Met Gly Gly 
255 

Phe Val Ser Ala 
270 

Thr Leu Gin Leu 
285 

Met Glu Lys Thr 



Leu Ser Gin Pro 
320 

Glu Ala His Gly 
335 

Arg Pro Pro Thr 
350 

lie Gin Phe Thr 
365 

Phe Cys Ser Ala 



Gin Thr Leu Glu 
400 

Asp Cys Leu Gly 
415 

Thr Cys Gin Pro 
430 
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Gin Gly Asn Pro 
435 

Val Pro Leu Pro 
450 

Gly Thr Tyr Lys 
465 

Asp Val His Leu 



He Val Gly Val 
500 

Ser He Tyr Thr 
515 

Gin Lys Ala Gin 
530 



Ala Pro Asn Leu 
440 

He Gly Met Val 
455 

Cys Arg Ala Phe 
470 

Thr Val Leu Tyr 
485 

Leu Val Leu He 



Tyr Tyr Arg Gin 
520 

Glu Glu Ala Leu 
535 



Thr Cys Ser Arg 



Lys Ser Val Lys 
460 

Ser Ser Arg Gly 
475 

His Asp Gin Asn 
490 

He Ala Gly Phe 
505 

Arg Lys He Arg 



Lys Leu Lys Val 
540 



Lys Ala Asp Gly 
445 

Arg Glu Met Asn 



Ser He Thr Arg 
480 

Thr Trp Val He 
495 

Val He Val Ala 
510 

He Tyr Lys Leu 
525 

Gin Ala Pro Pro 



Pro 
545 



<210> 87 
<211> 917 
<212> PRT 

<213> Rattus norvegicus 
<400> 87 

Met Pro Gly Pro Ser Pro Gly Leu Arg Arg Thr Leu Leu Gly Leu Trp 
15 10 15 

Ala Ala Leu Gly Leu Gly He Leu Gly He Ser Ala Val Ala Leu Glu 
20 25 30 

Pro Phe Trp Ala Asp Leu Gin Pro Arg Val Ala Leu Val Glu Arg Gly 
35 40 45 

Gly Ser Leu Trp Leu Asn Cys Ser Thr Asn Cys Pro Arg Pro Glu Arg 
50 55 60 

Gly Gly Leu Glu Thr Ser Leu Arg Arg Asn Gly Thr Gin Arg Gly Leu 
65 70 75 80 

Arg Trp Leu Ala Arg Gin Leu Val Asp He Arg Glu Pro Glu Thr Gin 
85 90 95 

Pro Val Cys Phe Phe Arg Cys Ala Arg Arg Thr Leu Gin Ala Arg Gly 
100 105 110 

Leu He Arg Thr Phe Gin Arg Pro Asp Arg Val Glu Leu Val Pro Leu 
115 120 125 

Pro Pro Trp Gin Pro Val Gly Glu Asn Phe Thr Leu Ser Cys Arg Val 
130 135 140 



Pro Gly Ala Gly Pro Arg Ala Ser Leu Thr Leu Thr Leu Leu Arg Gly 
145 * * 150 155 160 



WO 03/064992 



98/119 



PCT/US02/34197 



Gly Gin Glu Leu 



Arg Gly Ala Met 
180 

Arg Ala Asn Phe 
195 

Leu Gly Leu Phe 
210 

Ala Met Pro Pro 
225 

Val Gly Ser Glu 



Ala Pro Glu Ala 
260 

Pro Asn Val Thr 
275 

Thr Ala Ser Glu 
290 

Thr Leu Gly Gly 
305 

Ser Phe Pro Ala 



Gly Lys Met Val 
340 

Thr Leu Glu Gly 
355 

Gin Leu Asn Val 
370 

Ala Ala Leu Asp 
385 

Glu Leu Arg Val 



Arg Ser Trp Thr 
420 

Ala Arg Gly Asn 
435 

Gly Ala Val Leu 
450 

Ala Gly Thr Tyr 
465 



lie Arg Arg Ser 
165 

Leu Thr Ala Thr 



Ser Cys Leu Ala 
200 

Ala Asn Ser Ser 
215 

Leu Ser Pro Ser 
230 

Arg Pro Val Thr 
245 

Gly Val Tyr Leu 



Leu Asp Gly Glu 
280 

Glu Gin Glu Gly 
295 

Glu Ser Arg Glu 
310 

Pro Leu Leu Thr 
325 

Thr Val Ser Cys 



lie Pro Ala Ala 
360 

Thr Lys Asn Asp 
375 

Val Asp Gly Glu 
390 

Leu Tyr Ala Pro 
405 

Trp Pro Glu Gly 



Pro Glu Pro Ser 
440 

Ala Leu Gly Leu 
455 

Arg Cys Thr Ala 
470 



Phe Val Gly Glu 
170 

Val Leu Ala Arg 
185 

Glu Leu Asp Leu 



Ala Pro Arg Gin 
220 

Leu lie Ala Pro 
235 

Cys Thr Leu Asp 
250 

Ser Leu Gly Asp 
265 

Ser Leu Val Ala 



Thr Lys Gin Leu 
300 

Thr Gin Glu Asn 
315 

Leu Ser Glu Pro 
330 

Trp Ala Gly Ala 
345 

Val Pro Gly Gin 



Asp Lys Arg Gly 
380 

Thr Leu Arg Lys 
395 

Arg Leu Asp Asp 
410 

Pro Glu Gin Thr 
425 

Val His Cys Ala 



Leu Gly Pro Val 
460 

lie Asn Gly Gin 
475 



Pro Pro Arg Ala 
175 

Arg Glu Asp His 
190 

Arg Pro His Gly 
205 

Leu Arg Thr Phe 



Arg Phe Leu Glu 
240 

Gly Leu Phe Pro 
255 

Gin Arg Leu His 
270 

Thr Ala Thr Ala 
285 

Met Cys lie Val 



Leu Thr Val Tyr 
320 

Glu Ala Pro Glu 
335 

Arg Ala Leu Val 
350 

Pro Ala Glu Leu 
365 

Phe Phe Cys Asp 



Asn Gin Ser Ser 
400 

Leu Asp Cys Pro 
415 

Leu His Cys Glu 
430 

Arg Pro Asp Gly 
445 

Thr Arg Ala Leu 



Gly Gin Ala Val 
480 



WO 03/064992 



PCT/US02/34197 



Lys Asp Val Thr 



Gly Cys Pro Glu 
500 

Ser Cys Val Ala 
515 

Ser Gly Lys Glu 
530 

His Ala Gly Thr 
545 

Ala Lys Asn Val 



Leu Gly Cys Pro 
580 

Phe Ser Cys Glu 
595 

Gly Ser Glu Gly 
610 

Asn Ser Gly Ser 
625 

lie Tyr Leu Cys 



Val Val Val Ser 
660 

Pro Ser His Gin 
675 

Cys Ser Ala Arg 
690 

Gly Ala Ala Arg 
705 

Thr Tyr Leu Cys 



Val Thr Val Gly 
740 

Ser Pro Pro Ser 
755 

Ala Glu Ala Trp 
770 

Ala Leu Asn Leu 
785 



Leu Thr Val Glu 
485 

Arg lie Thr Trp 



His Gly Val Pro 
520 

Glu Val Met Glu 
535 

Tyr Arg Cys Glu 
550 

Ala Val Thr Val 
565 

Ser Asn Trp Thr 



Val Asp Gly Lys 
600 

Ala Ser Glu Gly 
615 

Arg Asn Ser Met 
630 

Asn Ala Thr Asn 
645 

Ala Glu Ser Pro 



Thr Trp Leu Glu 
680 

Gly Arg Pro Ser 
695 

Leu Glu Arg Leu 
710 

Val Ala Thr Asn 
725 

Val Glu Tyr Arg 



Val Arg Pro Gly 
760 

Pro Pro Ala Gin 
775 

Gly Leu Ser Ser 
790 
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Tyr Ala Pro Ala 
490 

Leu Glu Gly Thr 
505 

Pro Pro Ser Val 



Gly Pro Leu Arg 
540 

Ala lie Asn Ala 
555 

Glu Tyr Gly Pro 
570 

Trp Val Glu Gly 
585 

Pro Glu Pro Arg 



Val Val Leu Pro 
620 

Thr Pro Gly Asn 
635 

Arg His Gly Ser 
650 

Pro Gin Met Asp 
665 

Gly Ala Glu Ala 



Pro Arg Val Arg 
700 

Gin Val Ser Arg 
715 

Ala His Gly Thr 
730 

Pro Val Val Ala 
745 

Gly Asn Phe Thr 



lie Ser Trp Arg 
780 

Asn Asn Ser Thr 
795 



Leu Asp Ser Val 
495 

Glu Ala Ser Leu 
510 

Ser Cys Val Arg 
525 

Val Ala Arg Glu 



Arg Gly Ser Ala 
560 

Ser Phe Glu Glu 
575 

Ser Gly Lys Leu 
590 

Val Glu Cys Val 
605 

Leu Val Ser Ser 



Leu Ser Pro Gly 
640 

Thr Val Lys Thr 
655 

Glu Ser Ser Cys 
670 

Thr Ala Leu Ala 
685 

Cys Ser Arg Glu 



Glu Asp Ala Gly 
720 

Asp Ser Arg Thr 
735 

Glu Leu Ala Ala 
750 

Leu Thr Cys Arg 
765 

Ala Pro Pro Gly 



Leu Ser Val Ala 
800 



Gly Ala Met Gly Ser His Gly Gly Glu Tyr Glu Cys Ala Ala Thr Asn 
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815 



Ala His Gly Arg 
820 

Trp Leu Trp Val 
835 

Ala Ala Gly Ala 
850 

Lys Gly Glu Tyr 
865 

Cys Leu Asn Gly 



Thr Pro Gly Thr 
900 

Gin Leu Thr Ser 
915 



His Ala Arg Arg 



Ala Val Gly Gly 
840 

Gly Leu Ala Phe 
855 

Asn Val Gin Glu 
870 

Ala Gly Gly Thr 
885 

Ala Glu Ser Pro 



Ser 



lie Thr Val Arg 
825 

Ala Ala Gly Gly 



Tyr Val Gin Ser 
860 

Ala Glu Ser Ser 
875 

Pro Gly Ala Glu 
890 

Ala Asp Gly Glu 
905 



Val Ala Gly Pro 
830 

Ala Ala Leu Leu 
845 

Thr Ala Cys Lys 



Gly Glu Ala Val 
880 

Gly Gly Ala Glu 
895 

Val Phe Ala lie 
910 



<210> 88 
<211> 151 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> MOD_RES 
<222> (12) 

<223> Any amino acid 
<220> 

<221> MOD_RES 
<222> (77) . . (81) 
<223> Any amino acid 

<220> 

<221> MOD_RES 

<222> (132) 

<223> Any amino acid 

<220> 

<221> MOD_RES 
<222> (145) . . (147) 
<223> Any amino acid 

<400> 88 

Glu Asp Ser Gin Pro Met Cys Tyr Ser Asn Cys Xaa Asp Gly Gin Ser 
15 10 15 

Thr Ala Lys Thr Phe Leu Thr Val Tyr Trp Thr Pro Glu Arg Val Glu 
20 25 30 

Leu Ala Pro Leu Pro Ser Trp Gin Pro Val Gly Lys Asn Leu Thr Leu 
35 40 45 



Arg Cys Gin Val Glu Gly Gly Ala Pro Arg Ala Asn Leu Thr Val Val 
50 55 60 
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Leu Leu Arg Gly Glu Lys 
65 70 

Xaa Ala Glu Val Thr Thr 
85 

Ala Asn Phe Ser Cys Arg 
100 

Glu Leu Phe Glu Asn Thr 
115 

Leu Pro Ala Xaa Pro Pro 
130 

Xaa Xaa Xaa Gly Thr Val 
145 150 



Glu Leu Lys Arg Glu 
75 

Thr Val Leu Val Arg 
90 

Thr Glu Leu Asp Leu 
105 

Ser Ala Pro Tyr Gin 
120 

Gin Leu Val Ser Pro 
135 

Val 



Pro Xaa Xaa Xaa Xaa 
80 

Arg Asp His His Gly 
95 

Arg Pro Gin Gly Leu 
110 

Leu Gin Thr Phe Val 
125 

Arg Val Leu Glu Val 
140 



<210> 89 
<211> 1252 
<212> PRT 

<213> Rattus norvegicus 
<400> 89 

Met Gly Ala Lys Arg Val Thr Val Arg Gly Ala Arg Thr Ser Pro lie 
15 10 15 

His Arg Met Ser Ser Leu Thr Pro Leu Leu Leu Met Gly Met Leu Thr 
20 25 30 

Ser Gly Leu Ala Glu Ser Pro Val Pro Thr Ser Ala Pro Arg Gly Phe 
35 40 45 

Trp Ala Leu Ser Glu Asn Leu Thr Ala Val Glu Gly Thr Thr Val Lys 
50 55 60 

Leu Trp Cys Gly Val Arg Ala Pro Gly Ser Val Val Gin Trp Ala Lys 
65 70 75 80 

Asp Gly Leu Leu Leu Gly Pro Asn Pro Lys Met Pro Gly Phe Pro Arg 
85 90 95 

Tyr Ser Leu Glu Gly Asp Arg Ala Lys Gly Glu Phe His Leu Leu lie 
100 105 110 

Glu Ala Cys Asp Leu Ser Asp Asp Ala Glu Tyr Glu Cys Gin Val Gly 
115 120 125 

Arg Ser Glu Leu Gly Pro Glu Leu Val Ser Pro Lys Val lie Leu Ser 
130 135 140 

lie Leu Val Ser Pro Lys Val Leu Leu Leu Thr Pro Glu Ala Gly Ser 
145 150 155 160 

Thr Val Thr Trp Val Ala Gly Gin Glu Tyr Val Val Thr Cys Val Ser 
165 170 175 



Gly Asp Ala Lys Pro Ala Pro Asp He Thr Phe He Gin Ser Gly Arg 
180 185 190 
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Thr lie Leu Asp 
195 

Leu Cys lie Thr 
210 

Asn Gly Gin Leu 
225 

Pro lie Lys Ala 



Pro Val lie Asp 
260 

Glu Asn Leu Glu 
275 

Thr Leu Gin Trp 
290 

Thr Glu His Ala 
305 

Arg Pro Glu Asp 



Val Ser Ala Gly 
340 

Pro Pro Ser Ala 
355 

Lys Asn Val Thr 
370 

Leu Leu Arg Trp 
385 

Thr Val Met Asp 



Thr Phe Leu Val 
420 

Ala Phe Ser Asp 
435 

Leu Asn Val Lys 
450 

Glu Gly Gin Tyr 
465 

Ala He Gly Gly 



Arg Pro Val Ser 
500 



Val Ser Ser Asn 
200 

Glu Ala Glu Ala 
215 

Leu Val Cys Glu 
230 

Ser Phe Thr Met 
245 

Trp Pro Gly Leu 



Leu Pro Cys Thr 
280 

Leu Lys Asn Gly 
295 

Gin Ala Val Ala 
310 

His Gly Ala Arg 
325 

Thr Gin Glu Arg 



He Thr He Leu 
360 

Leu Cys Cys Leu 
375 

Trp Leu Gly Gly 
390 

Gly Leu His Gly 
405 

Arg Arg Glu Asp 



Ala Phe Ser Lys 
440 

Tyr Pro Ala Gin 
455 

He Arg Thr Gly 
470 

Asn Pro Asp Pro 
485 

Glu Pro Arg Gin 



Val Asn Glu Gly 



Arg Val He Pro 
220 

Gly Ser Asn Pro 
235 

Asn He Leu Phe 
250 

Asn Glu Gly His 
265 

Ala Arg Gly Gly 



Lys Pro Val Ser 
300 

His Ser Val Leu 
315 

Leu Ser Cys Gin 
330 

Ser He Thr Leu 
345 

Gly Ser Val Ser 



Thr Lys Ser Ser 
380 

Arg Gin Leu Leu 
395 

Gly His He Ser 
410 

Asn Gly Leu Pro 
425 

Glu Thr Phe Lys 



Lys Leu Trp He 
460 

Thr Arg Val Arg 
475 

Ser Leu He Trp 
490 

Pro Gin Glu Pro 
505 



Ser Glu Glu Lys 
205 

Gin Ser Ser Asp 



Ala Leu Asp Thr 
240 

Pro Pro Gly Pro 
255 

Val Arg Ala Gly 
270 

Asn Pro Pro Ala 
285 

Thr Ala Trp Gly 



Val Met Thr Val 
320 

Ser Tyr Asn Ser 
335 

Gin Val Thr Phe 
350 

Gin Ser Glu Asn 
365 

Arg Pro Arg Val 



Pro Thr Asp Glu 
400 

Met Ser Asn Leu 
415 

Leu Thr Cys Glu 
430 

Lys Ser Leu Thr 
445 

Glu Gly Pro Pro 



Leu Val Cys Leu 
480 

Phe Lys Asp Ser 
495 

Arg Arg Val Gin 
510 
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Leu Gly Ser Val Glu Lys Ser Gly Ser Thr Phe Ser Arg Glu Leu Val 
515 520 525 

Leu lie lie Gly Pro Pro Asp Asn Arg Ala Lys Phe Ser Cys Lys Ala 
530 535 540 

Gly Gin Leu Ser Ala Ser Thr Gin Leu Val Val Gin Phe Pro Pro Thr 
545 550 555 560 

Asn Leu Thr lie Leu Ala Asn Ser Ser Ala Leu Arg Pro Gly Asp Ala 
565 570 575 

Leu Asn Leu Thr Cys Val Ser lie Ser Ser Asn Pro Pro Val Asn Leu 
580 585 590 

Ser Trp Asp Lys Glu Gly Glu Arg Leu Glu Asp Val Ala Ala Lys Pro 
595 600 605 

Gin Ser Ala Pro Phe Lys Gly Ser Ala Ala Ser Arg Ser Val Phe Leu 
610 615 620 

Arg Val Ser Ser Arg Asp His Gly Gin Arg Val Thr Cys Arg Ala His 
625 630 635 640 

Ser Glu Ala Leu Arg Glu Thr Val Ser Ser Phe Tyr Arg Phe Asn Val 
645 650 655 

Leu Tyr Pro Pro Glu Phe Leu Gly Glu Gin Val Arg Ala Val Thr Val 
660 665 670 

Val Glu Gin Gly Gin Val Leu Leu Pro Val Ser Val Ser Ala Asn Pro 
675 680 685 

Ala Pro Glu Ala Phe Asn Trp Thr Phe Arg Gly Tyr Arg Leu Ser Pro 
690 695 700 

Ala Gly Gly Pro Arg His Arg lie Leu Ser Gly Gly Ala Leu Gin Leu 
705 710 715 720 

Trp Asn Val Thr Arg Ala Asp Asp Gly Phe Tyr Gin Leu His Cys Gin 
725 730 735 

Asn Ser Glu Gly Thr Ala Glu Ala Leu Leu Lys Leu Asp Val His Tyr 
740 745 750 

Ala Pro Thr lie Arg Ala Leu Arg Asp Pro Thr Glu Val Asn Val Gly 
755 760 765 

Gly Ser Val Asp lie Val Cys Thr Val Asp Ala Asn Pro lie Leu Pro 
770 775 780 

Glu Met Phe Ser Trp Glu Arg Leu Gly Glu Glu Glu Glu Asp Leu Asn 
785 790 795 800 

Leu Asp Asp Met Glu Lys Val Ser Lys Gly Ser Thr Gly Arg Leu Arg 
805 810 815 

lie Arg Gin Ala Lys Leu Ser Gin Ala Gly Ala Tyr Gin Cys He Val 
820 825 830 

Asp Asn Gly Val Ala Pro Ala Ala Arg Gly Leu Val Arg Leu Val Val 
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835 840 845 

Arg Phe Ala Pro Gin Val Asp Gin Pro Thr Pro Leu Thr Lys Val Ala 
850 855 860 

Ala Ala Gly Asp Ser Thr Ser Ser Ala Thr Leu His Cys Arg Ala Arg 
865 870 875 880 

Gly Val Pro Asn lie Asp Phe Thr Trp Thr Lys Asn Gly Val Pro Leu 
885 890 895 

Asp Leu Gin Asp Pro Arg Tyr Thr Glu His Arg Tyr His Gin Gly Val 
900 905 910 

Val His Ser Ser Leu Leu Thr lie Ala Asn Val Ser Ala Ala Gin Asp 
915 920 925 

Tyr Ala Leu Phe Lys Cys Thr Ala Thr Asn Ala Leu Gly Ser Asp His 
930 935 940 

Thr Asn lie Gin Leu Val Ser lie Ser Arg Pro Asp Pro Pro Leu Gly 
945 950 955 960 

Leu Lys Val Val Ser lie Ser Pro His Ser Val Gly Leu Glu Trp Lys 
965 970 975 

Pro Gly Phe Asp Gly Gly Leu Pro Gin Arg Phe Gin lie Arg Tyr Glu 
980 985 990 

Ala Leu Glu Thr Pro Gly Phe Leu His Val Asp Val Leu Pro Thr Gin 
995 1000 1005 

Ala Thr Thr Phe Thr Leu Thr Gly Leu Lys Pro Ser Thr Arg Tyr Arg 
1010 1015 1020 

lie Trp Leu Leu Ala Ser Asn Ala Leu Gly Asp Ser Gly Leu Thr Asp 
1025 1030 1035 1040 

Lys Gly lie Gin Val Ser Val Thr Thr Pro Gly Pro Asp Gin Ala Pro 
1045 1050 1055 

Glu Asp Thr Asp His Gin Leu Pro Thr Glu Leu Pro Pro Gly Pro Pro 
1060 1065 1070 

Arg Leu Pro Leu Leu Pro Val Leu Phe Ala Val Gly Gly Leu Leu Leu 
1075 1080 1085 

Leu Ser Asn Ala Ser Cys Val Gly Gly Leu Leu Trp Arg Arg Arg Leu 
1090 1095 1100 

Arg Arg Leu Ala Glu Glu lie Ser Glu Lys Thr Glu Ala Gly Ser Glu 
1105 1110 1115 1120 

Asp Arg lie Arg Asn Glu Tyr Glu Glu Ser Gin Trp Thr Gly Asp Arg 
1125 1130 1135 

Asp Thr Arg Ser Ser Thr Val Ser Thr Ala Glu Val Asp Pro Asn Tyr 
1140 1145 1150 



Tyr Ser Met Arg Asp Phe Ser Pro Gin Leu Pro Pro Thr Leu Glu Glu 
1155 1160 1165 
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Val Leu Tyr His Gin Gly Ala Glu Gly Glu Asp Met Ala Phe Pro Gly 
1170 1175 1180 

His Leu His Asp Glu Val Glu Arg Ala Tyr Gly Pro Pro Gly Ala Trp 
1185 1190 1195 1200 

Gly Pro Leu Tyr Asp Glu Val Arg Met Asp Pro Tyr Asp Leu Arg Trp 
1205 1210 1215 

Pro Glu Val Gin Cys Glu Asp Pro Arg Gly lie Tyr Asp Gin Val Ala 
1220 1225 1230 

Ala Asp Met Asp Ala Val Glu Ala Ser Ser Leu Pro Phe Glu Leu Arg 
1235 1240 1245 

Gly His Leu Val 
1250 



<210> 90 
<211> 1256 
<212> PRT 

<213> Mus musculus 
<400> 90 

Met Gly Ala Lys Glu Val Thr Val Arg Gly Pro Gly Ala Ser Pro Val 
15 10 15 

His Arg Thr Cys Arg Leu lie Pro Leu Leu Leu Ala Gly Met Leu Thr 
20 25 30 

Thr Gly Leu Ala Gin Ser Pro Val Pro Thr Ser Ala Pro Arg Gly Phe 
35 40 45 

Trp Ala Leu Ser Glu Asn Leu Thr Val Val Glu Gly Ser Thr lie Lys 
50 55 60 

Leu Trp Cys Gly Val Arg Ala Pro Gly Ser Val Val Gin Trp Ala Lys 
65 70 75 80 

Asp Gly Leu Leu Leu Gly Pro Asn Pro Lys lie Pro Gly Phe Pro Arg 
85 90 95 

Tyr Ser Leu Glu Gly Asp Ser Ala Lys Gly Glu Phe His Leu Leu lie 
100 105 110 

Glu Ala Cys Asp Leu Ser Asp Asp Ala Glu Tyr Glu Cys Gin Val Gly 
115 120 125 

Arg Ser Glu Leu Gly Pro Glu Leu Val Ser Pro Arg Val lie Leu Ser 
130 135 140 

Val Leu Val Pro Pro Lys Val Leu Gin Leu Thr Pro Glu Ala Gly Ser 
145 150 155 160 

Thr Val Thr Trp Val Ala Gly Gin Glu Tyr Val Val Thr Cys Val Ser 
165 170 175 



Gly Gly Ala Lys Pro Ala Pro Asp lie lie Phe lie Gin Gly Gly Arg 
180 185 190 
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Thr Val Glu Asp 
195 

Leu Phe Phe Thr 
210 

Asn Gly Gin Leu 
225 

Pro lie Lys Ala 



Pro Val lie Asp 
260 

Glu Asn Leu Glu 
275 

Thr Leu Gin Trp 
290 

Thr Glu His Ala 
305 

Arg Pro Glu Asp 



Val Ser Ala Glu 
340 

Pro Pro Ser Ala 
355 

Lys Asn Val Thr 
370 

Leu Leu Arg Trp 
385 

Thr Val Met Asp 



Thr Leu Leu Val 
420 

Ala Phe Ser Asp 
435 

Leu Asn Val Lys 
450 

Glu Gly Gin Ser 
465 

Ala lie Gly Gly 



Arg Pro Val Asn 
500 



Val Ser Ser Ser 
200 

Glu Ala Glu Ala 
215 

Leu Val Cys Glu 
230 

Ser Phe Thr Met 
245 

Trp Pro Gly Leu 



Leu Pro Cys lie 
280 

Leu Lys Asn Gly 
295 

Gin Ala Val Ala 
310 

His Gly Ala Arg 
325 

Thr Gin Glu Arg 



Val Thr lie Leu 
360 

Leu Cys Cys Leu 
375 

Trp Leu Gly Gly 
390 

Gly Leu His Gly 
405 

Lys Arg Glu Asp 



Ala Phe Ser Lys 
440 

Tyr Pro Ala Gin 
455 

lie Arg Thr Gly 
470 

Asn Pro Glu Pro 
485 

Asp Pro Arg Gin 



Val Asn Glu Gly 



Arg Val Thr Pro 
220 

Gly Ser Asn Pro 
235 

Asn lie Leu Phe 
250 

Asn Glu Gly His 
265 

Ala Arg Gly Gly 



Lys Pro Val Ser 
300 

His Ser Val Leu 
315 

Leu Ser Cys Gin 
330 

Ser lie Thr Leu 
345 

Gly Ser Thr Ser 



Thr Lys Ser Ser 
380 

Arg Gin Leu Leu 
395 

Gly His lie Ser 
410 

Asn Gly Leu Ser 
425 

Glu Thr Phe Lys 



Lys Leu Trp lie 
460 

Thr Arg Val Arg 
475 

Ser Leu Thr Trp 
490 

Ser Gin Glu Pro 
505 



Ser Glu Glu Lys 
205 

Gin Ser Ser Asp 



Ala Leu Ala Thr 
240 

Pro Pro Gly Pro 
255 

Val Arg Ala Gly 
270 

Asn Pro Pro Ala 
285 

lie Ala Trp Gly 



Val Met Thr Val 
320 

Ser Tyr Asn Ser 
335 

Gin Val Thr Phe 
350 

Gin Ser Glu Asn 
365 

Arg Pro Arg Val 



Pro Thr Asp Glu 
400 

Met Ser Asn Leu 
415 

Leu Thr Cys Glu 
430 

Lys Ser Leu Thr 
445 

Glu Gly Pro Pro 



Leu Val Cys Leu 
480 

Leu Lys Asp Ser 
4 95 

Arg Arg Val Gin 
510 
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Leu Gly Ser Val 
515 

Leu He lie Gly 
530 

Gly Gin Leu Ser 
545 

Asn Leu Thr He 



Leu Asn Leu Thr 
580 

Ser Leu Asp Lys 
595 

Gin Ser Ala Pro 
610 

Arg Val Ser Ser 
625 

Ser Glu Ala Leu 



Leu Tyr Pro Pro 
660 

Val Glu Gin Gly 
675 

Ala Pro Glu Ala 
690 

Ala Gly Gly Pro 
705 

Trp Asn Val Thr 



Asn Ser Glu Gly 
740 

Ala Pro Thr He 
755 

Gly Ser Val Asp 
770 

Glu Met Phe Ser 
785 

Leu Asp Asp Met 



He Arg Gin Ala 
820 



Glu Lys Ser Gly 
520 

Pro Pro Asp Asn 
535 

Ala Ser Thr Gin 
550 

Leu Ala Asn Ser 
565 

Cys Val Ser He 



Glu Gly Glu Arg 
600 

Phe Lys Gly Ser 
615 

Arg Asp His Gly 
630 

Arg Glu Thr Val 
645 

Glu Phe Leu Gly 



Gin Ala Leu Leu 
680 

Phe Asn Trp Thr 
695 

Arg His Arg He 
710 

Arg Ala Asp Asp 
725 

Thr Ala Glu Ala 



Arg Ala Leu Lys 
760 

He Val Cys Thr 
775 

Trp Glu Arg Leu 
790 

Glu Lys Met Ser 
805 

Lys Leu Ser Gin 
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Ser Thr Phe Ser 



Leu Ala Lys Phe 
540 

Leu Val Val Gin 
555 

Ser Ala Leu Arg 
570 

Ser Ser Asn Pro 
585 

Leu Asp Asp Val 



Ala Ala Ser Arg 
620 

His Arg Val Thr 
635 

Ser Ser Phe Tyr 
650 

Glu Gin Val Arg 
665 

Pro Val Ser Val 



Phe Arg Gly Tyr 
700 

Leu Ser Gly Gly 
715 

Gly Phe Tyr Gin 
730 

Leu Leu Lys Leu 
745 

Asp Pro Thr Glu 



Val Asp Ala Asn 
780 

Gly Glu Asp Glu 
795 

Lys Gly Ser Thr 
810 

Ala Gly Ala Tyr 
825 



Arg Glu Leu Val 
525 

Ser Cys Lys Ala 



Phe Pro Pro Thr 
560 

Pro Gly Asp Ala 
575 

Pro Val Asn Leu 
590 

Ala Ala Lys Pro 
605 

Ser Val Phe Leu 



Cys Arg Ala His 
640 

Arg Leu Asn Val 
655 

Ala Val Thr Val 
670 

Ser Ala Asn Pro 
685 

Arg Leu Ser Pro 



Ala Leu Gin Leu 
720 

Leu His Cys Gin 
735 

Asp Val His Tyr 
750 

Val Asn Val Gly 
765 

Pro He Leu Pro 



Glu Glu Leu Asn 
800 

Gly Arg Leu Arg 
815 

Gin Cys He Val 
830 



Asp Asn Gly Val Ala Pro Ala Ala Arg Gly Leu Val Arg Leu Val Val 
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835 840 845 

Arg Phe Ala Pro Gin Val Asp His Pro Thr Pro Leu Thr Lys Val Ala 
850 855 860 

Ala Ala Gly Asp Ser Thr Ser Ser Ala Thr Leu His Cys Arg Ala Arg 
865 870 875 880 

Gly Val Pro Asn lie Asp Phe Thr Trp Thr Lys Asn Gly Val Pro Leu 
885 890 895 

Asp Leu Gin Asp Pro Arg Tyr Thr Glu His Lys Tyr His Gin Gly Val 
900 905 910 

Val His Ser Ser Leu Leu Thr lie Ala Asn Val Ser Ala Ala Gin Asp 
915 920 925 

Tyr Ala Leu Phe Lys Cys Thr Ala Thr Asn Ala Leu Gly Ser Asp His 
930 935 940 

Thr Asn lie Gin Leu Val Ser lie Ser Arg Pro Asp Pro Pro Leu Gly 
945 950 955 960 

Leu Lys Val Val Ser Val Ser Pro His Ser Val Gly Leu Glu Trp Lys 
965 970 975 

Pro Gly Phe Asp Gly Gly Leu Pro Gin Arg Phe Gin lie Arg Tyr Glu 
980 985 990 

Ala Leu Glu Thr Pro Gly Phe Leu Tyr Met Asp Val Leu Pro Ala Gin 
995 1000 1005 

Ala Thr Thr Phe Thr Leu Thr Gly Leu Lys Pro Ser Thr Arg Tyr Arg 
1010 1015 1020 

lie Trp Leu Leu Ala Ser Asn Ala Leu Gly Asp Ser Gly Leu Thr Asp 
1025 1030 1035 1040 

Lys Gly lie Gin Val Ser lie Thr Thr Pro Gly Leu Asp Gin Ala Pro 
1045 1050 1055 

Glu Asp Thr Asp Gin Pro Leu Pro Thr Glu Gin Pro Pro Gly Pro Pro 
1060 1065 1070 

Arg Leu Pro Leu Leu Pro Val Leu Phe Ala Val Gly Gly Leu Leu Leu 
1075 1080 1085 

Leu Ser Asn Ala Ser Cys Val Gly Gly Leu Leu Trp Arg Arg Arg Leu 
1090 1095 1100 

Arg Arg Leu Ala Glu Glu lie Ser Glu Lys Thr Glu Ala Gly Ser Glu 
1105 1110 1115 1120 

Glu Asp Arg lie Arg Asn Glu Tyr Glu Glu Ser Gin Trp Thr Gly Asp 
1125 1130 1135 

Arg Asp Thr Arg Ser Ser Thr Val Ser Thr Ala Glu Val Asp Pro His 
1140 1145 1150 



Tyr Tyr Ser Met Arg Asp Phe Ser Pro Gin Leu Pro Pro Thr Leu Glu 
1155 " 1160 1165 
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Glu Val Ser Tyr Arg Gin Ala Phe Thr Gly He Glu Asp Glu Asp Met 
1170 1175 ~ 1180 

Ala Phe Pro Gly His Leu Tyr Asp Glu Val Glu Arg Val Tyr Gly Pro 
1185 1190 1195 1200 

Pro Gly Val Trp Gly Pro Leu Tyr Asp Glu Val Gin Met Asp Pro Tyr 
1205 1210 1215 

Asp Leu Arg Trp Pro Glu Val Lys Tyr Glu Asp Pro Arg Gly He Tyr 
1220 1225 1230 

Asp Gin Val Ala Ala Asp Met Asp Ala Gly Glu Pro Gly Ser Leu Pro 
1235 1240 1245 

Phe Glu Leu Arg Gly His Leu Val 
1250 1255 



<210> 91 

<211> 29 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 91 

gacctgtact tcattttgga caaatcagg 29 



<210> 92 

<211> 34 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 92 

gagctcaaaa ttgagtggat gatgccttgc agag 3 4 



<210> 93 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 93 

ggtaccactt ctctcaatcc aactttc 27 



<210> 94 

<211> 49 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 94 

gtccaaaatg aagtacaggt cagccaaact agtagaggtg aacaaaagc 4 9 



<210> 95 

<211> 28 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 95 

gggggacgca gggaggatgg gggtccag 2 8 



<210> 96 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 96 

gagctcccgt cagaacagtg tgtggtggtg 3 0 



<210> 97 

<211> 46 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 97 

atcctccctg cgtcccccag ccaaactagt agaggtgaac aaaagc 4 6 



<210> 98 
<211> 1650 
<212> DNA 

<213> Unknown Organism 

<220> 
<221> CDS 

<222> (34) . . (1647) 
<220> 

<223> Description of Unknown Organism: ATR- lgA2 fusion 
nucleotide 

<400> 98 

ggtaccactt ctctcaatcc aactttctaa aca atg get tct aaa cct ttc ttg 54 

Met Ala Ser Lys Pro Phe Leu 

1 5 
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tct ctt ctt tct ttg tct ttg ctt ttg ttc acc tct act agt ttg get 102 

Ser Leu Leu Ser Leu Ser Leu Leu Leu Phe Thr Ser Thr Ser Leu Ala 

10 15 20 

gac ctg tac ttc att ttg gac aaa tea gga agt gtg ctg cac cac tgg 150 

Asp Leu Tyr Phe lie Leu Asp Lys Ser Gly Ser Val Leu His His Trp 

25 30 35 

aat gaa ate tat tac ttt gtg gaa cag ttg get cac aaa ttc ate age 198 

Asn Glu lie Tyr Tyr Phe Val Glu Gin Leu Ala His Lys Phe lie Ser 

40 45 50 55 

cca cag ttg aga atg tec ttt att gtt ttc tec acc cga gga aca acc 246 

Pro Gin Leu Arg Met Ser Phe lie Val Phe Ser Thr Arg Gly Thr Thr 
60 65 70 

tta atg aaa ctg aca gaa gac aga gaa caa ate cgt caa ggc eta gaa 2 94 

Leu Met Lys Leu Thr Glu Asp Arg Glu Gin lie Arg Gin Gly Leu Glu 
75 80 85 

gaa etc cag aaa gtt ctg cca gga gga gac act tac atg cat gaa gga 342 

Glu Leu Gin Lys Val Leu Pro Gly Gly Asp Thr Tyr Met His Glu Gly 

90 95 100 

ttt gaa agg gee agt gag cag att tat tat gaa aac aga caa ggg tac 3 90 

Phe Glu Arg Ala Ser Glu Gin lie Tyr Tyr Glu Asn Arg Gin Gly Tyr 

105 110 115 

agg aca gee age gtc ate att get ttg act gat gga gaa etc cat gaa 438 

Arg Thr Ala Ser Val lie lie Ala Leu Thr Asp Gly Glu Leu His Glu 

120 125 130 135 

gat etc ttt ttc tat tea gag agg gag get aat agg tct cga gat ctt 486 

Asp Leu Phe Phe Tyr Ser Glu Arg Glu Ala Asn Arg Ser Arg Asp Leu 
140 145 150 

ggt gca att gtt tac tgt gtt ggt gtg aaa gat ttc aat gag aca cag 534 

Gly Ala lie Val Tyr Cys Val Gly Val Lys Asp Phe Asn Glu Thr Gin 
155 160 165 

ctg gee egg att gcg gac agt aag gat cat gtg ttt ccc gtg aat gac 582 

Leu Ala Arg lie Ala Asp Ser Lys Asp His Val Phe Pro Val Asn Asp 

170 175 180 

ggc ttt cag get ctg caa ggc ate ate cac tea att ttg age tct get 630 

Gly Phe Gin Ala Leu Gin Gly lie lie His Ser lie Leu Ser Ser Ala 

185 190 195 

tec cca acc age cct aag gtc ttc cct etc age ctt gac age acc cct 678 

Ser Pro Thr Ser Pro Lys Val Phe Pro Leu Ser Leu Asp Ser Thr Pro 

200 205 210 215 

caa gat ggt aat gtt gtc gtt get tgc ctt gtc cag ggt ttc ttc cct 726 

Gin Asp Gly Asn Val Val Val Ala Cys Leu Val Gin Gly Phe Phe Pro 
220 225 230 

cag gag cca etc tct gtt acc tgg tct gaa tct gga cag aat gtt acc 774 

Gin Glu Pro Leu Ser Val Thr Trp Ser Glu Ser Gly Gin Asn Val Thr 
235 240 245 



gee aga aac ttc cca cct age cag gat gee tec ggt gac etc tac acc 



822 
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Ala Arg Asn Phe Pro Pro Ser Gin Asp Ala Ser Gly Asp Leu Tyr Thr 
250 255 260 

acc age tct cag etc acc ctt cca gec acc cag tgc cca gat ggt aag 870 

Thr Ser Ser Gin Leu Thr Leu Pro Ala Thr Gin Cys Pro Asp Gly Lys 
265 270 275 

tec gtt acc tgc cat gtt aag cac tac acc aac tec age cag gat gtt 918 

Ser Val Thr Cys His Val Lys His Tyr Thr Asn Ser Ser Gin Asp Val 

280 285 290 295 

act gtt cca tgc cgt gtt cca cca cct cca cca tgc tgc cac cca cgt 966 

Thr Val Pro Cys Arg Val Pro Pro Pro Pro Pro Cys Cys His Pro Arg 

300 305 310 

etc tct ctt cac cgt cct gee ctt gag gac ttg etc ttg ggt tct gaa 1014 

Leu Ser Leu His Arg Pro Ala Leu Glu Asp Leu Leu Leu Gly Ser Glu 

315 320 325 

get aac etc acc tgc acc etc acc ggt etc aga gat gee tct ggt gee 1062 

Ala Asn Leu Thr Cys Thr Leu Thr Gly Leu Arg Asp Ala Ser Gly Ala 
330 335 340 

acc ttc acc tgg acc cca age tct ggt aag age get gtt caa gga cca 1110 

Thr Phe Thr Trp Thr Pro Ser Ser Gly Lys Ser Ala Val Gin Gly Pro 
345 350 355 

cct gag cgt gac etc tgt gga tgc tac tct gtt age tct gtt ctt cct 1158 

Pro Glu Arg Asp Leu Cys Gly Cys Tyr Ser Val Ser Ser Val Leu Pro 

360 365 370 375 

ggt tgt gee cag cct tgg aac cac ggt gag acc ttc acc tgc act get 1206 

Gly Cys Ala Gin Pro Trp Asn His Gly Glu Thr Phe Thr Cys Thr Ala 

380 385 390 

gee cac cca gag ttg aag acc cca ctt acc gee aac ate acc aag tec 1254 

Ala His Pro Glu Leu Lys Thr Pro Leu Thr Ala Asn lie Thr Lys Ser 

395 400 405 

gga aac acc ttc cgt ccc gag gtc cac etc ttg cca cca cca tct gag 1302 

Gly Asn Thr Phe Arg Pro Glu Val His Leu Leu Pro Pro Pro Ser Glu 
410 415 420 

gag ctt gee etc aat gag ctt gtt acc etc acc tgc ctt get cgt gga 1350 

Glu Leu Ala Leu Asn Glu Leu Val Thr Leu Thr Cys Leu Ala Arg Gly 
425 430 435 

ttc age cca aag gat gtt ctt gtt agg tgg ctt cag gga tct cag gag 13 98 

Phe Ser Pro Lys Asp Val Leu Val Arg Trp Leu Gin Gly Ser Gin Glu 

440 445 450 455 

ctt cca cgt gag aag tac etc act tgg get tec cgt cag gag cca age 1446 

Leu Pro Arg Glu Lys Tyr Leu Thr Trp Ala Ser Arg Gin Glu Pro Ser 

460 465 470 

cag gga act acc acc tac get gtt acc age ate ctt cgt gtt get get 1494 

Gin Gly Thr Thr Thr Tyr Ala Val Thr Ser lie Leu Arg Val Ala Ala 

475 480 485 

gag gac tgg aag aag ggt gag acc ttc tec tgc atg gtt ggt cac gag 1542 

Glu Asp Trp Lys Lys Gly Glu Thr Phe Ser Cys Met Val Gly His Glu 
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490 495 500 

gcc ctt cca ctt gcc ttc acc cag aag acc att gat cgt ttg get gga 1590 
Ala Leu Pro Leu Ala Phe Thr Gin Lys Thr lie Asp Arg Leu Ala Gly 
505 510 515 

aag cca acc cac ate aat gtt tct gtt gtc atg get gag get gat gga 163 8 
Lys Pro Thr His lie Asn Val Ser Val Val Met Ala Glu Ala Asp Gly 
520 525 530 535 

acc tgc tac taa 1650 
Thr Cys Tyr 

<210> 99 
<211> 538 
<212> PRT 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: ATR-IgA2 fusion 
amino acid 

<400> 99 

Met Ala Ser Lys Pro Phe Leu Ser Leu Leu Ser Leu Ser Leu Leu Leu 
15 10 15 

Phe Thr Ser Thr Ser Leu Ala Asp Leu Tyr Phe lie Leu Asp Lys Ser 
20 25 30 

Gly Ser Val Leu His His Trp Asn Glu lie Tyr Tyr Phe Val Glu Gin 
35 40 45 

Leu Ala His Lys Phe lie Ser Pro Gin Leu Arg Met Ser Phe lie Val 
50 55 60 

Phe Ser Thr Arg Gly Thr Thr Leu Met Lys Leu Thr Glu Asp Arg Glu 
65 70 75 80 

Gin lie Arg Gin Gly Leu Glu Glu Leu Gin Lys Val Leu Pro Gly Gly 
85 90 95 

Asp Thr Tyr Met His Glu Gly Phe Glu Arg Ala Ser Glu Gin lie Tyr 
100 105 110 

Tyr Glu Asn Arg Gin Gly Tyr Arg Thr Ala Ser Val lie lie Ala Leu 
115 120 125 

Thr Asp Gly Glu Leu His Glu Asp Leu Phe Phe Tyr Ser Glu Arg Glu 
130 135 140 

Ala Asn Arg Ser Arg Asp Leu Gly Ala He Val Tyr Cys Val Gly Val 
145 150 155 160 

Lys Asp Phe Asn Glu Thr Gin Leu Ala Arg He Ala Asp Ser Lys Asp 
165 170 175 

His Val Phe Pro Val Asn Asp Gly Phe Gin Ala Leu Gin Gly He He 
180 185 190 



His Ser He Leu Ser Ser Ala Ser Pro Thr Ser Pro Lys Val Phe Pro 
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Leu Ser Leu Asp 
210 

Leu Val Gin Gly 
225 

Glu Ser Gly Gin 



Ala Ser Gly Asp 
260 

Thr Gin Cys Pro 
275 

Thr Asn Ser Ser 
290 

Pro Pro Cys Cys 
305 

Asp Leu Leu Leu 



Leu Arg Asp Ala 
340 

Lys Ser Ala Val 
355 

Ser Val Ser Ser 
370 

Glu Thr Phe Thr 
385 

Thr Ala Asn lie 



Leu Leu Pro Pro 
420 

Leu Thr Cys Leu 
435 

Trp Leu Gin Gly 
450 

Ala Ser Arg Gin 
465 

Ser lie Leu Arg 



Ser Cys Met Val 
500 

Thr lie Asp Arg 
515 



Ser Thr Pro Gin 
215 

Phe Phe Pro Gin 
230 

Asn Val Thr Ala 
245 

Leu Tyr Thr Thr 



Asp Gly Lys Ser 
280 

Gin Asp Val Thr 
295 

His Pro Arg Leu 
310 

Gly Ser Glu Ala 
325 

Ser Gly Ala Thr 



Gin Gly Pro Pro 
360 

Val Leu Pro Gly 
375 

Cys Thr Ala Ala 
390 

Thr Lys Ser Gly 
405 

Pro Ser Glu Glu 



Ala Arg Gly Phe 
440 

Ser Gin Glu Leu 
455 

Glu Pro Ser Gin 
470 

Val Ala Ala Glu 
485 

Gly His Glu Ala 



Leu Ala Gly Lys 
520 



Asp Gly Asn Val 
220 

Glu Pro Leu Ser 
235 

Arg Asn Phe Pro 
250 

Ser Ser Gin Leu 
265 

Val Thr Cys His 



Val Pro Cys Arg 
300 

Ser Leu His Arg 
315 

Asn Leu Thr Cys 
330 

Phe Thr Trp Thr 
345 

Glu Arg Asp Leu 



Cys Ala Gin Pro 
380 

His Pro Glu Leu 
395 

Asn Thr Phe Arg 
410 

Leu Ala Leu Asn 
425 

Ser Pro Lys Asp 



Pro Arg Glu Lys 
460 

Gly Thr Thr Thr 
475 

Asp Trp Lys Lys 
490 

Leu Pro Leu Ala 
505 

Pro Thr His lie 



Val Val Ala Cys 



Val Thr Trp Ser 
240 

Pro Ser Gin Asp 
255 

Thr Leu Pro Ala 
270 

Val Lys His Tyr 
285 

Val Pro Pro Pro 



Pro Ala Leu Glu 
320 

Thr Leu Thr Gly 
335 

Pro Ser Ser Gly 
350 

Cys Gly Cys Tyr 
365 

Trp Asn His Gly 



Lys Thr Pro Leu 
400 

Pro Glu Val His 
415 

Glu Leu Val Thr 
430 

Val Leu Val Arg 
445 

Tyr Leu Thr Trp 



Tyr Ala Val Thr 
480 

Gly Glu Thr Phe 
495 

Phe Thr Gin Lys 
510 

Asn Val Ser Val 
525 
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Val Met Ala Glu Ala Asp Gly Thr Cys Tyr 
530 535 



<210> 100 
<211> 6602 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknovm Organism: 
pGPTV-kan-ocs-ATR-IgA2 



<400> 100 

ctggccggcg 

ataaaccttt 

ggtttacccg 

caatctgatc 

acaagccgtt 

tctgaattca 

caattctgtt 

gtttcggttc 

ctccgttcaa 

atatattgtg 

attgcgtttt 

ggatatgaaa 

gtagtcttct 

atatattatt 

ttattataat 

attcgacggt 

acaaattgcc 

aggaaactgg 

acctacgatg 

ctgtaggatc 

accatgtacg 

ggcctgtggt 

accggtgagt 

cgttggatgt 

gtttttggtg 

tcattaattt 

gctaagagcg 

agctcaacat 

gacgtaagta 

taggacatgg 

tgatccgcag 

aaaagtgtac 

gccatttcgc 

ttaaggtacc 

tcttctttct 

tttggacaaa 

gttggctcac 

aggaacaacc 

actccagaaa 

tgagcagatt 

gactgatgga 

tcgagatctt 

ggcccggatt 

gcaaggcatc 

tctcagcctt 



ccagatctgg 
tcacgccctt 
ccaatatatc 
atgagcggag 
ttacgtttgg 
ctgctttaat 
gtgcacgttg 
attctaatga 
tttactgatt 
ctgaataggt 
attattacaa 
caactattat 
attgatgagg 
aataattaac 
aatattctta 
atcgcgataa 
ttttcttatc 
tagctgttgt 
gggggcatcg 
cctgaaagcg 
taagcgctta 
ctcaagatgg 
aatattgtac 
taacatctac 
gacccttgag 
ccaccttcac 
aatttggcct 
tgatctcttt 
tccgagtcag 
caaccgggcc 
ccattaacga 
caaacaacgc 
cttttcagaa 
acttctctca 
ttgtctttgc 
tcaggaagtg 
aaattcatca 
ttaatgaaac 
gttctgccag 
tattatgaaa 
gaactccatg 
ggtgcaattg 
gcggacagta 
atccactcaa 
gacagcaccc 



ggaacctgtg 
ttaaatatcc 
ctgtcaaaca 
aattaaggga 
aactgacaga 
gagatatgcg 
taaaaaacct 
atatatcacc 
gtaccctact 
ttatagcgac 
atccaatttt 
ttatctacat 
agtctaaggc 
aataattaat 
caaatataat 
gctcgcggat 
gaccatgtac 
gggcctgtgg 
caccggtgag 
acgttggatg 
cgtttttggt 
atcattaatt 
ggctaagagc 
aaattgcctt 
gaaactggta 
ctacgatggg 
gtaggatccg 
ctcgatcgag 
tttttatttt 
tgaatttcgc 
cttttgaata 
tttacagcaa 
atggataaat 
atccaacttt 
ttttgttcac 
tgctgcacca 
gcccacagtt 
tgacagaaga 
gaggagacac 
acagacaagg 
aagatctctt 
tttactgtgt 
aggatcatgt 
ttttgagctc 
ctcaagatgg 



gttggcatgc 
gattattcta 
ctgatagttt 
gtcacgttat 
accgcaacgt 
agacgcctat 
gagcatgtgt 
cgttactatc 
acttatatgt 
atctatgata 
gagctcggcg 
gttttagatg 
tatagaatta 
atattataat 
tattatattc 
ccctgaaagc 
gtaagcgctt 
tctcaagatg 
taatattgta 
ttaacatcta 
ggacccttga 
tccaccttca 
gaatttggcc 
ttcttatcga 
gctgttgtgg 
gggcatcgca 
cgagctggtc 
ggagattttt 
tctactaatt 
gggtattctg 
gatacgctga 
gaacggaatg 
agccttgctt 
ctaaacaatg 
ctctactagt 
ctggaatgaa 
gagaatgtcc 
cagagaacaa 
ttacatgcat 
gtacaggaca 
tttctattca 
tggtgtgaaa 
gtttcccgtg 
tgcttcccca 
taatgttgtc 



acatacaaat 
ataaacgctc 
aaactgaagg 
gaccccgccg 
tgaaggagcc 
gatcgcatga 
agctcagatc 
gtatttttat 
acaatattaa 
gagcgccaca 
cgccagctgg 
ttatctgatt 
tatatctaaa 
ttatatatat 
gacggtatcg 
gacgttggat 
acgtttttgg 
gatcattaat 
cggctaagag 
caaattgcct 
ggaaactggt 
cctacgatgg 
tgtaggatcc 
ccatgtacgt 
gcctgtggtc 
ccggtgagta 
aatcccattg 
caaatcagtg 
tggtcgttta 
tttctattcc 
cacgccaagc 
cgcgtgacgc 
cctattatat 
gcttctaaac 
ttggctgacc 
atctattact 
tttattgttt 
atccgtcaag 
gaaggatttg 
gccagcgtca 
gagagggagg 
gatttcaatg 
aatgacggct 
accagcccta 
gttgcttgcc 



ggacgaacgg 
ttttctctta 
cgggaaacga 
atgacgcggg 
actcagccga 
tatttgcttt 
cttaccgccg 
gaataatatt 
aatgaaaaca 
ataacaaaca 
acatcatgtt 
atttttatac 
tgattaatat 
atattttata 
gggcaattgt 
gttaacatct 
tggacccttg 
ttccaccttc 
cgaatttggc 
tttcttatcg 
agctgttgtg 
ggggcatcgc 
ctgaaagcga 
aagcgcttac 
tcaagatgga 
atattgtacg 
cttttgaagc 
cgcaagacgt 
tttcggcgtg 
aactttttct 
ctcgctagtc 
tcgcggtgac 
cttcccttaa 
ctttcttgtc 
tgtacttcat 
ttgtggaaca 
tctccacccg 
gcctagaaga 
aaagggccag 
tcattgcttt 
ctaataggtc 
agacacagct 
ttcaggctct 
aggtcttccc 
ttgtccaggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 
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tttcttccct 
cagaaacttc 
cacccttcca 
caccaactcc 
ccacccacgt 
taacctcacc 
cccaagctct 
ctctgttagc 
ctgcactgct 
aaacaccttc 
tgagcttgtt 
gtggcttcag 
ggagccaagc 
ggactggaag 
cttcacccag 
tgtcatggct 
gatccactag 
gacagacccg 
tgagcttaaa 
cggcatctat 
gaccggatat 
gtatgtgtta 
acaattaata 
ggatattttt 
tgggaaaacc 
tggcgcgcca 
ggaacacgta 
ggctatctgg 
tacatggcga 
tggggcgccc 
gccaaggatc 
tatgaccccc 
acgttgaagg 
agaaaccatt 
ttgtcaaaaa 
tattcactct 
aggttctccg 
cggctgctct 
caagaccgac 
gctggccacg 
ggactggctg 
tgccgagaaa 
tacctgccca 
agccggtctt 
actgttcgcc 
cgatgcctgc 
tggccggctg 
tgaagagctt 
cgattcgcag 
aggatccccc 
cactcagtct 
tgaatttaaa 
tattttatca 
gtcgttttac 
gcacatcccc 
caacagttgc 
cacgttcgcc 
tagtgcttta 
gccatcgccc 
tggactcttg 
ataagggatt 



caggagccac 
ccacctagcc 
gccacccagt 
agccaggatg 
ctctctcttc 
tgcaccctca 
ggtaagagcg 
tctgttcttc 
gcccacccag 
cgtcccgagg 
accctcacct 
ggatctcagg 
cagggaacta 
aagggtgaga 
aagaccattg 
gaggctgatg 
ttctagctag 
ttcttacacc 
gccggcgctg 
gtcgagggca 
gaggccctca 
attttcattt 
aatattcaaa 
attataatat 
ctggcgttac 
agcttcacgc 
gaaagccagt 
acaagggaaa 
tagctagact 
tctggtaagg 
tgatggcgca 
gccgatgacg 
agccactcag 
attgcgcgtt 
tgctccactg 
caatccagat 
gccgcttggg 
gatgccgccg 
ctgtccggtg 
acgggcgttc 
ctattgggcg 
gtatccatca 
ttcgaccacc 
gtcgatcagg 
aggctcaagg 
ttgccgaata 
ggtgtggcgg 
ggcggcgaat 
cgcatcgcct 
gatgagctaa 
ttcatctacg 
cttgcatcaa 
ataaatattt 
aacgtcgtga 
ctttcgccag 
gcagcctgaa 
ggctttcccc 
cggcacctcg 
tgatagacgg 
ttccaaactg 
ttgccgattt 



tctctgttac 
aggatgcctc 
gcccagatgg 
ttactgttcc 
accgtcctgc 
ccggtctcag 
ctgttcaagg 
ctggttgtgc 
agttgaagac 
tccacctctt 
gccttgctcg 
agcttccacg 
ccacctacgc 
ccttctcctg 
atcgtttggc 
gaacctgcta 
agcggccgcc 
ggactgggcg 
agaccatgct 
ttggtggagc 
ctccgcttga 
gcagtgcagt 
tcagattatt 
tgatgataat 
ccaacttaat 
tgccgcaagc 
ccgcagaaac 
acgcaagcgc 
gggcggtttt 
ttgggaagcc 
ggggatcaag 
cgggacaagc 
ccgcgggttt 
caaaagtcgc 
acgttccata 
ctggatcgtt 
tggagaggct 
tgttccggct 
ccctgaatga 
cttgcgcagc 
aagtgccggg 
tggctgatgc 
aagcgaaaca 
atgatctgga 
cgcgcatgcc 
tcatggtgga 
accgctatca 
gggctgaccg 
tctatcgcct 
gctagctata 
gcaatgtacc 
taaatttatg 
aaactatatt 
ctgggaaaac 
ctggcgtaat 
tggcgcccgc 
gtcaagctct 
accccaaaaa 
tttttcgccc 
gaacaacact 
cggaaccacc 
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ctggtctgaa 
cggtgacctc 
taagtccgtt 
atgccgtgtt 
ccttgaggac 
agatgcctct 
accacctgag 
ccagccttgg 
cccacttacc 
gccaccacca 
tggattcagc 
tgagaagtac 
tgttaccagc 
catggttggt 
tggaaagcca 
ctaagatctg 
accgcggtgg 
cgggatagga 
caaggtaggc 
gcgcttcggg 
tcttggcaaa 
attttctatt 
gactgtcatt 
tcactggccg 
cgccttgcag 
actcagggcg 
ggtgctgacc 
aaagagaaag 
atggacagca 
ctgcaaagta 
atcatgagcg 
cgttttacgt 
ctggagttta 
ctaaggtcac 
aattcccctc 
tcgcatgatt 
attcggctat 
gtcagcgcag 
actgcaggac 
tgtgctcgac 
gcaggatctc 
aatgcggcgg 
tcgcatcgag 
cgaagagcat 
cgacggcgat 
aaatggccgc 
ggacatagcg 
cttcctcgtg 
tcttgacgag 
tcatcaattt 
agctgatata 
tttttgcttg 
tctttcaaga 
cctggcgtta 
agcgaagagg 
tcctttcgct 
aaatcggggg 
acttgatttg 
tttgacgttg 
caaccctatc 
atcaaacagg 



tctggacaga 
tacaccacca 
acctgccatg 
ccaccacctc 
ttgctcttgg 

ggtgccacct 

cgtgacctct 
aaccacggtg 
gccaacatca 
tctgaggagc 
ccaaaggatg 
ctcacttggg 
atccttcgtg 
cacgaggccc 
acccacatca 
tgaattcctg 
cgaattaaca 
tattcagatt 
aatgtcctca 
gataccgtgc 
gatatttgac 
cgatctttat 
tgtatcaaat 
tcgttttaca 
cacatccccc 
caagggctgc 
ccggatgaat 
caggtagctt 
agcgaaccgg 
aactggatgg 
gagaattaag 
ttggaactga 
atgagctaag 
tatcagctag 
ggtatccaat 
gaacaagatg 
gactgggcac 
gggcgcccgg 
gaggcagcgc 
gttgtcactg 
ctgtcatctc 
ctgcatacgc 
cgagcacgta 
caggggctcg 
gatctcgtcg 
ttttctggat 
ttggctaccc 
ctttacggta 
ttcttctgag 
atgtattaca 
atcagttatt 
gactataata 
tgggaattaa 
cccaacttaa 
cccgcaccga 
ttcttccctt 
ctccctttag 
ggtgatggtt 
gagtccacgt 
tcgggctatt 
attttcgcct 



atgttaccgc 
gctctcagct 
ttaagcacta 
caccatgctg 
gttctgaagc 
tcacctggac 
gtggatgcta 
agaccttcac 
ccaagtccgg 
ttgccctcaa 
ttcttgttag 
cttcccgtca 
ttgctgctga 
ttccacttgc 
atgtttctgt 
cagcccgggg 
gaggtggatg 
gggatgggat 
gcgtcgagcc 
ttgtaactga 
gcatttatta 
gtaattcgtt 
cgtgtttaat 
acgtcgtgac 
tttcgccagc 
taaaggaagc 
gtcagctact 
gcagtgggct 
aattgccagc 
ctttcttgcc 
ggagtcacgt 
cagaaccgca 
cacatacgtc 
caaatatttc 
tagagtctca 
gattgcacgc 
aacagacaat 
ttctttttgt 
ggctatcgtg 
aagcgggaag 
accttgctcc 
ttgatccggc 
ctcggatgga 
cgccagccga 
tgacccatgg 
tcatcgactg 
gtgatattgc 
tcgccgctcc 
cgggactctg 
cataatatcg 
gaaatatttc 
cctgacttgt 
ttcactggcc 
tcgccttgca 
tcgcccttcc 
cctttctcgc 
ggttccgatt 
cacgtagtgg 
tctttaatag 
cttttgattt 
gctggggcaa 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
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accagcgtgg accgcttgct gcaactctct 
ttgcccgtct cactggtgaa aagaaaaacc 
gttattaagt tgtctaagcg tcaatttgtt 
ccaacagctc cccgaccggc agctcggcac 
ag 
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cagggccagg cggtgaaggg caatcagctg 64 2 0 
accccagtac attaaaaacg tccgcaatgt 6480 
tacaccacaa tatatcctgc caccagccag 6540 
aaaatcacca ctcgatacag gcagcccatc 6600 

6602 



<210> 101 
<211> 7129 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: 
pGPTV-hpt-ocs-3 5SJ/SC 



<400> 101 

ctgatgggct 

ggctggctgg 

acacattgcg 

aacagctgat 

gtttgcccca 

ataaatcaaa 

cactattaaa 

gcccactacg 

taaatcggaa 

tggcgagaaa 

tgggaagggc 

gctgcaaggc 

acggccagtg 

taacaagtca 

cagaaatatt 

tgcgatatta 

ccggtcggca 

tatcggcgag 

gtgtacgccc 

aagctgcatc 

ggagcatata 

agcgcgtctg 

cgtattggga 

tgtccgtcag 

agtcctcggc 

cgtccatcac 

gccatgtagt 

taagatcggc 

gttcggtttc 

tcaggctctc 

caaagtgccg 

ggacatatcc 

gctgcatcag 

cggtgagttc 

gtgaatatga 

ttgacaagaa 

ggtttctgac 

tcaacgttgc 

ggtcataacg 

ccttggcggc 

cgccccagct 

ccatgtaagc 

gatagccagt 

gtgttccgct 



gcctgtatcg 
tggcaggata 
gacgttttta 
tgcccttcac 
gcaggcgaaa 
agaatagccc 
gaacgtggac 
tgaaccatca 
ccctaaaggg 
ggaagggaag 
gatcggtgcg 
gattaagttg 
aattaattcc 
ggtattatag 
tcaataactg 
tgtgtaatac 
tctactctat 
tacttctaca 
gacagtcccg 
atcgaaattg 
cgcccggagc 
ctgctccata 
atccccgaac 
gacattgttg 
ccaaagcatc 
agtttgccag 
gtattgaccg 
cgcagcgatc 
aggcaggtct 
gctgaatgcc 
ataaacataa 
acgccctcct 
gtcggagacg 
aggctttttc 
gactctaatt 
atatttgcta 
gtatgtgctt 
ggttctgtca 
tgactccctt 
aagaaagcca 
ggcaattccg 
ccactgcaag 
agctgacatt 
tcctttagca 



agtggtgatt 
tattgtggtg 
atgtactggg 
cgcctggccc 
atcctgtttg 
gagatagggt 
tccaacgtca 
cccaaatcaa 
agcccccgat 
aaagcgaaag 
ggcctcttcg 
ggtaacgcca 
catcttgaaa 
tccaagcaaa 
attatatcag 
ataaattgat 
tcctttgccc 
cagccatcgg 
gctccggatc 
ccgtcaacca 
cgcggcgatc 
caagccaacc 
atcgcctcgc 
gagccgaaat 
agctcatcga 
tgatacacat 
attccttgcg 
gcatccatgg 
tgcaacgtga 
ccaatgtcaa 
cgatctttgt 
acatcgaagc 
ctgtcgaact 
atatcttatt 
ggataccgag 
gctgatagtg 
agctcattaa 
gttccaaacg 
aattctccgc 
tccagtttac 
gttcgcttgc 
ctacctgctt 
catccggggt 
gcccttgcgc 



ttgtgccgag 
taaacaaatt 
gtggtttttc 
tgagagagtt 
atggtggttc 
tgagtgttgt 
aagggcgaaa 
gttttttggg 
ttagagcttg 
gagcgggcgc 
ctattacgcc 
gggttttccc 
gaaatatagt 
aacataaatt 
ctggtacatt 
gatatagcta 
tcggacgagt 
tccagacggc 
ggacgattgc 
agctctgata 
ctgcaagctc 
acggcctcca 
tccagtcaat 
ccgcgtgcac 
gagcctgcgc 
ggggatcagc 
gtccgaatgg 
cctccgcgac 
caccctgtgc 
gcacttccgg 
agaaaccatc 
tgaaagcacg 
tttcgatcag 
gcccccctag 
gggaatttat 
accttaggcg 
actccagaaa 
taaaacggct 
tcatgatctt 
tttgcagggc 
tgtccataaa 
tctctttgcg 
cagcaccgtt 
cctgagtgct 



ctgccggtcg 
gacgcttaga 
ttttcaccag 
gcagcaagcg 
cgaaatcggc 
tccagtttgg 
aaccgtctat 
gtcgaggtgc 
acggggaaag 
cattcaggct 
agctggcgaa 
agtcacgacg 
ttaaatattt 
tattgatgca 
gccgtagatg 
gcttagctca 
gctggggcgt 
cgcgcttctg 
gtcgcatcga 
gagttggtca 
cggatgcctc 
gaagaagatg 
gaccgctgtt 
gaggtgccgg 
gacggacgca 
aatcgcgcat 
gccgaacccg 
cggctgcaga 
acggcgggag 
aatcgggagc 
ggcgcagcta 
agattcttcg 
aaacttctcg 
agtcgagatc 
ggaacgtcag 
acttttgaac 
cccgcggctg 
tgtcccgcgt 
gatcccctgc 
ttcccaacct 
accgcccagt 
cttgcgtttt 
tctgcggact 
tgcggcagcg 



gggagctgtt 
caacttaata 
tgagacgggc 
gtccacgctg 
aaaatccctt 
aacaagagtc 
cagggcgatg 
cgtaaagcac 
ccggcgaacg 
gcgcaactgt 
agggggatgt 
ttgtaaaacg 
attgataaaa 
agtttaaatt 
aaagactgag 
tcgggggatc 
cggtttccac 
cgggcgattt 
ccctgcgccc 
agaccaatgc 
cgctcgaagt 
ttggcgacct 
atgcggccat 
acttcggggc 
ctgacggtgt 
atgaaatcac 
ctcgtctggc 
acagcgggca 
atgcaatagg 
gcggccgatg 
tttacccgca 
ccctccgaga 
acagacgtcg 
tggattgaga 
tggagcattt 
gcgcaataat 
agtggctcct 
catcggcggg 
gccatcagat 
taccagaggg 
ctagctatcg 
cccttgtcca 
ggctttctac 
tgaagcttgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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cgcgccagct ggacatcatg ttggatatga aacaactatt atttatctac atgttttaga 2700 
tgttatctga ttatttttat acgtagtctt ctattgatga ggagtctaag gctatagaat 2760 
tatatatcta aatgattaat atatatatta ttaataatta acaataatta atatattata 2820 
atttatatat atatatttta tattattata ataatattct tacaaatata attattatat 2880 
tcgacggtat cggggcaatt gattcccgat cctatctgtc acttcatcaa aaggacagta 2940 
gaaaaggaag gtggcaccta caaatgccat cattgcgata aaggaaaggc tatcattcaa 3000 
gatgcctctg ccgacagtgg tcccaaagat ggacccccac ccacgaggag catcgtggaa 3 060 
aaagaagacg ttccaaccac gtcttcaaag caagtggatt gatgtgatat ctccactgac 312 0 
gtaagggatg acgcacaatc ccactatcct tcgcaagacc cttcctctat ataaggaagt 3180 
tcatttcatt tggagaggac acgctgaaat caccagtctc tctctacaag gtaccatggt 3240 
gctcttcgtg ctcacctgcc tgctggcggt cttcccagcc atctccacga agagtcccat 3300 
atttggtccc gaggaggtga atagtgtgga aggtaactca gtgtccatca cgtgctacta 3 3 60 
cccacccacc tctgtcaacc ggcacacccg gaagtactgg tgccggcagg gagctagagg 342 0 
tggctgcata accctcatct cctcggaggg ctacgtctcc agcaaatatg caggcagggc 3480 
taacctcacc aacttcccgg agaacggcac atttgtggtg aacattgccc agctgagcca 354 0 
ggatgactcc gggcgctaca agtgtggcct gggcatcaat agccgaggcc tgtcctttga 3 600 
tgtcagcctg gaggtcagcc agggtcctgg gctcctaaat gacactaaag tctacacagt 3 660 
ggacctgggc agaacggtga ccatcaactg ccctttcaag actgagaatg ctcaaaagag 3 72 0 
gaagtccttg tacaagcaga taggcctgta ccctgtgctg gtcatcgact ccagtggtta 3780 
tgtgaatccc aactatacag gaagaatacg ccttgatatt cagggtactg gccagttact 3 84 0 
gttcagcgtt gtcatcaacc aactcaggct cagcgatgct gggcagtatc tctgccaggc 3 900 
tggggatgat tccaatagta ataagaagaa tgctgacctc caagtgctaa agcccgagcc 3 960 
cgagctggtt tatgaagacc tgaggggctc agtgaccttc cactgtgccc tgggccctga 402 0 
ggtggcaaac gtggccaaat ttctgtgccg acagagcagt ggggaaaact gtgacgtggt 4 0 80 
cgtcaacacc ctggggaaga gggccccagc ctttgagggc aggatcctgc tcaaccccca 414 0 
ggacaaggat ggctcattca gtgtggtgat cacaggcctg aggaaggagg atgcagggcg 42 0 0 
ctacctgtgt ggagcccatt cggatggtca gctgcaggaa ggctcgccta tccaggcctg 4260 
gcaactcttc gtcaatgagg agtccacgat tccccgcagc cccactgtgg tgaagggggt 432 0 
ggcaggaagc tctgtggccg tgctctgccc ctacaaccgt aaggaaagca aaagcatcaa 43 80 
gtactggtgt ctctgggaag gggcccagaa tggccgctgc cccctgctgg tggacagcga 4440 
ggggtgggtt aaggcccagt acgagggccg cctctccctg ctggaggagc caggcaacgg 4 50 0 
caccttcact gtcatcctca accagctcac cagccgggac gccggcttct act,ggtgtct 4560 
gaccaacggc gatactctct ggaggaccac cgtggagatc aagattatcg aaggagaacc 4620 
aaacctcaag gttcccggga atgtcacggc tgtgctggga gagactctca aggtcccctg 4680 
tcactttcca tgcaaattct cctcgtacga gaaatactgg tgcaagtgga ataacacggg 4740 
ctgccaggcc ctgcccagcc aagacgaagg ccccagcaag gccttcgtga actgtgacga 4800 
gaacagccgg cttgtctccc tgaccctgaa cctggtgacc agggctgatg agggctggta 4 860 
ctggtgtgga gtgaagcagg gccacttcta tggagagact gcagccgtct atgtggcagt 4 92 0 
tgaagagagg aaggcagcgg ggtcccgcga tgtcagccta gcgaaggcag acgctgctcc 4 980 
tgatgagaag gtgctagact ctggttttcg ggagattgag aacaaagcca ttcaggatcc 5040 
caggcttttt gcagagtgaa ttcgttcgta tcatcggttt cgacaacgtt cgtcaagttc 5100 
aatgcatcag tttcattgcg cacacaccag aatcctactg agttcgagta ttatggcatt 5160 
gggaaaactg tttttcttgt accatttgtt gtgcttgtaa tttactgtgt tttttattcg 5220 
gttttcgcta tcgaactgtg aaatggaaat ggatggagaa gagttaatga atgatatggt 52 80 
ccttttgttc attctcaaat taatattatt tgttttttct cttatttgtt gtgtgttgaa 5340 
tttgaaatta taagagatat gcaaacattt tgttttgagt aaaaatgtgt caaatcgtgg 54 00 
cctctaatga ccgaagttaa tatgaggagt aaaacacttg tagttgtcga cggtatcgat 5460 
attaattccc gatcctatct gtcacttcat caaaaggaca gtagaaaagg aaggtggcac 5520 
ctacaaatgc catcattgcg ataaaggaaa ggctatcatt caagatgcct ctgccgacag 5580 
tggtcccaaa gatggacccc cacccacgag gagcatcgtg gaaaaagaag acgttccaac 5640 
cacgtcttca aagcaagtgg attgatgtga tatctccact gacgtaaggg atgacgcaca 5700 
atcccactat ccttcgcaag acccttcctc tatataagga agttcatttc atttggagag 5760 
gacacgctga aatcaccagt ctctctctag agtaccatgg agaaccattt gcttttctgg 582 0 
ggagtcctgg cggtttttat taaggctgtt catgtgaaag cccaagaaga tgaaaggatt 58 80 
gttcttgttg acaacaaatg taagtgtgcc cggattactt ccaggatcat ccgttcttcc 5940 
gaagatccta atgaggacat tgtggagaga aacatccgaa ttattgttcc tctgaacaac 6000 
agggagaata tctctgatcc cacctcacca ttgagaacca gatttgtgta ccatttgtct 6060 
gacctctgta aaaaatgtga tcctacagaa gtggagctgg ataatcagat agttactgct 6120 
acccagagca atatctgtga tgaagacagt gctacagaga cctgctacac ttatgacaga 6180 
aacaagtgct acacagctgt ggtcccactc gtatatggtg gtgagaccaa aatggtggaa 6240 
acagccttaa ccccagatgc ctgctatcct gactgaatcc gcggcgatga gctaagctag 6300 
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ctatatcatc 
gtaccagctg 
ttatgttttt 
atatttcttt 
gtggcgctct 
taatattgta 
taaaaatacg 
atctgagcta 
catgcgatca 
gctccttcaa 

ggcggggtca 

cttcagttta 
agcgtttatt 
tttgtatgtg 



aatttatgta 
atataatcag 
gcttggacta 
caagagctca 
atcatagatg 
catataagta 
atagtaacgg 
cacatgctca 
taggcgtctc 
cgttgcggtt 
taacgtgact 
aactatcagt 
agaataatcg 
catgccaacc 



ttacacataa 
ttattgaaat 
taatacctga 
aaattggatt 
tcgctataaa 
gtagggtaca 
gtgatatatt 
ggttttttac 
gcatatctca 
ctgtcagttc 
cccttaattc 
gtttgacagg 
gatatttaaa 
acaggttccc 
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tatcgcactc 
atttctgaat 
cttgttattt 
tgtaataata 
cctattcagc 
atcagtaaat 
cattagaatg 
aacgtgcaca 
ttaaagcagt 
caaacgtaaa 
tccgctcatg 
atatattggc 
agggcgtgaa 
cagatctggc 



agtctttcat 
ttaaacttgc 
tatcaataaa 
aaacgcaatt 
acaatatatt 
tgaacggaga 
aaccgaaacc 
acagaattga 
gaattcagat 
acggcttgtc 
atcagattgt 
gggtaaacct 
aaggtttatc 
gccggccag 



ctacggcaat 
atcaataaat 
tatttaaact 
gtttgttatt 
gttttcattt 
atattattca 
ggcggtaagg 
aagcaaatat 
cggctgagtg 
ccgcgtcatc 
cgtttcccgc 
aagagaaaag 
cgttcgtcca 



6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7129 



